2025-05-07T19:42:32.1434555Z Current runner version: '2.323.0' 2025-05-07T19:42:32.1440650Z Runner name: 'i-0a33abd677e10917f' 2025-05-07T19:42:32.1441620Z Machine name: 'ip-10-0-72-2' 2025-05-07T19:42:32.1444621Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:32.1446724Z Contents: read 2025-05-07T19:42:32.1447300Z Metadata: read 2025-05-07T19:42:32.1447834Z Packages: read 2025-05-07T19:42:32.1448322Z ##[endgroup] 2025-05-07T19:42:32.1450300Z Secret source: None 2025-05-07T19:42:32.1450912Z Prepare workflow directory 2025-05-07T19:42:32.2065875Z Prepare all required actions 2025-05-07T19:42:32.2103476Z Getting action download info 2025-05-07T19:42:32.3774478Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:32.6112450Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:33.0660674Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.12, 12.6.3, clang) 2025-05-07T19:42:33.1481778Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:33.1598840Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:33.1608753Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:33.1609658Z ##[endgroup] 2025-05-07T19:42:34.2717802Z Runner Type: linux.24xlarge 2025-05-07T19:42:34.2719099Z Instance Type: c5.24xlarge 2025-05-07T19:42:34.2720012Z AMI Name: unknown 2025-05-07T19:42:34.2762987Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:39.3295156Z ##[group]Checking docker version 2025-05-07T19:42:39.3307456Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:39.3508892Z '1.44' 2025-05-07T19:42:39.3524814Z Docker daemon API version: '1.44' 2025-05-07T19:42:39.3525338Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:39.3712967Z '1.44' 2025-05-07T19:42:39.3722295Z Docker client API version: '1.44' 2025-05-07T19:42:39.3726973Z ##[endgroup] 2025-05-07T19:42:39.3729478Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:39.3734219Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=77a39f" 2025-05-07T19:42:39.3883032Z ##[command]/usr/bin/docker network prune --force --filter "label=77a39f" 2025-05-07T19:42:39.4023187Z ##[endgroup] 2025-05-07T19:42:39.4023509Z ##[group]Create local container network 2025-05-07T19:42:39.4032706Z ##[command]/usr/bin/docker network create --label 77a39f github_network_b577c48a8b564d3ca14c8193d220a990 2025-05-07T19:42:39.6095001Z 9301200b2e5635cb41129782a538aa1f3e637060722df0826977ae62d003ee4f 2025-05-07T19:42:39.6116707Z ##[endgroup] 2025-05-07T19:42:39.6146499Z ##[group]Starting job container 2025-05-07T19:42:39.6170932Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:39.7819716Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:39.8290879Z 1c3112c87ab2: Pulling fs layer 2025-05-07T19:42:40.3927637Z 1c3112c87ab2: Verifying Checksum 2025-05-07T19:42:40.3928777Z 1c3112c87ab2: Download complete 2025-05-07T19:42:42.1442896Z 1c3112c87ab2: Pull complete 2025-05-07T19:42:42.1528764Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:42.1549326Z Status: Downloaded newer image for amazonlinux:2023 2025-05-07T19:42:42.1562539Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:42.1646501Z ##[command]/usr/bin/docker create --name caafb01e9845451cad3dc12376cedc73_amazonlinux2023_f45b6e --label 77a39f --workdir /__w/FBGEMM/FBGEMM --network github_network_b577c48a8b564d3ca14c8193d220a990 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:42.6205133Z 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f 2025-05-07T19:42:42.6234872Z ##[command]/usr/bin/docker start 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f 2025-05-07T19:42:43.1048676Z 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f 2025-05-07T19:42:43.1069864Z ##[command]/usr/bin/docker ps --all --filter id=2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:43.1223165Z 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f Up Less than a second 2025-05-07T19:42:43.1245000Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f 2025-05-07T19:42:43.1417709Z HOME=/github/home 2025-05-07T19:42:43.1418312Z GITHUB_ACTIONS=true 2025-05-07T19:42:43.1418923Z CI=true 2025-05-07T19:42:43.1419416Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:43.1438724Z ##[endgroup] 2025-05-07T19:42:43.1448672Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:43.1450527Z ##[endgroup] 2025-05-07T19:42:43.1527866Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:43.1528646Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:43.1529608Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:43.1529979Z env: 2025-05-07T19:42:43.1530260Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:43.1530689Z BUILD_ENV: build_binary 2025-05-07T19:42:43.1530976Z BUILD_TARGET: default 2025-05-07T19:42:43.1531267Z BUILD_VARIANT: cuda 2025-05-07T19:42:43.1531687Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:43.1532011Z ##[endgroup] 2025-05-07T19:42:43.9657098Z Amazon Linux 2023 repository 64 MB/s | 37 MB 00:00 2025-05-07T19:42:50.5680051Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:42:51.1222963Z Dependencies resolved. 2025-05-07T19:42:51.1398529Z Nothing to do. 2025-05-07T19:42:51.1400207Z Complete! 2025-05-07T19:42:51.3679514Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:42:51.4310239Z Dependencies resolved. 2025-05-07T19:42:51.4537450Z ======================================================================================== 2025-05-07T19:42:51.4538786Z Package Arch Version Repository Size 2025-05-07T19:42:51.4539423Z ======================================================================================== 2025-05-07T19:42:51.4540034Z Installing: 2025-05-07T19:42:51.4540589Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:51.4541216Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:51.4542000Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:51.4542692Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:51.4543264Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:51.4543839Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:51.4544435Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:51.4545042Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:51.4545705Z Installing dependencies: 2025-05-07T19:42:51.4546308Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:51.4547040Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:51.4547985Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4548705Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:51.4549343Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:51.4550084Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:51.4550746Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:51.4551433Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:51.4552250Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:51.4552945Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:51.4553547Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:51.4554168Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:51.4554911Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:51.4555571Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:51.4556157Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:51.4556812Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:51.4557475Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:51.4558170Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:51.4558871Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:51.4559549Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:51.4560127Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:51.4560763Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:51.4561503Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:51.4562298Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:51.4676839Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:51.4677653Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:51.4678199Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:51.4678756Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:51.4679360Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:51.4679998Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4680632Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:51.4681281Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:51.4681813Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:51.4682446Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:51.4683057Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:51.4683631Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:51.4684430Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:51.4685015Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:51.4685583Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:51.4686143Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:51.4686689Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4687277Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:51.4687838Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:51.4688430Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:51.4689032Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:51.4689632Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:51.4690234Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:51.4690943Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:51.4691577Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:51.4692182Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:51.4692917Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:51.4693465Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:51.4693993Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:51.4694517Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:51.4695033Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:51.4695588Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:51.4696183Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:51.4696776Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:51.4697311Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:51.4697880Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:51.4698442Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:51.4699014Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:51.4699568Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4700148Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:51.4700731Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:51.4701281Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:51.4701804Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:51.4702326Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:51.4702894Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:51.4703455Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:51.4704012Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:51.4704662Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:51.4705255Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:51.4705835Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:51.4706361Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:51.4706883Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:51.4707405Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:51.4707933Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:51.4708466Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:51.4708973Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:51.4709481Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:51.4709973Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:51.4710549Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:51.4711072Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:51.4711897Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:51.4712494Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:51.4713085Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:51.4713647Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:51.4714207Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:51.4714750Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:51.4715309Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:51.4715847Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:51.4716387Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:51.4716848Z Installing weak dependencies: 2025-05-07T19:42:51.4717300Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:51.4717925Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:51.4718522Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:51.4719128Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:51.4719716Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:51.4720285Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:51.4720650Z 2025-05-07T19:42:51.4720750Z Transaction Summary 2025-05-07T19:42:51.4721031Z ======================================================================================== 2025-05-07T19:42:51.4721379Z Install 107 Packages 2025-05-07T19:42:51.4721529Z 2025-05-07T19:42:51.4721653Z Total download size: 38 M 2025-05-07T19:42:51.4721914Z Installed size: 151 M 2025-05-07T19:42:51.4722178Z Downloading Packages: 2025-05-07T19:42:51.7586372Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.8 MB/s | 82 kB 00:00 2025-05-07T19:42:51.7730328Z (2/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 22 MB/s | 786 kB 00:00 2025-05-07T19:42:51.7745524Z (3/107): elfutils-debuginfod-client-0.188-3.amz 2.8 MB/s | 41 kB 00:00 2025-05-07T19:42:51.8026100Z (4/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 81 MB/s | 5.3 MB 00:00 2025-05-07T19:42:51.8078982Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 17 MB/s | 539 kB 00:00 2025-05-07T19:42:51.8094996Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 1.7 MB/s | 54 kB 00:00 2025-05-07T19:42:51.8312288Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 58 MB/s | 1.1 MB 00:00 2025-05-07T19:42:51.8616765Z (8/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 82 MB/s | 4.7 MB 00:00 2025-05-07T19:42:51.8796381Z (9/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 41 MB/s | 2.8 MB 00:00 2025-05-07T19:42:51.8868500Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 20 MB/s | 1.0 MB 00:00 2025-05-07T19:42:51.8903582Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.9 MB/s | 160 kB 00:00 2025-05-07T19:42:51.8989619Z (12/107): jansson-2.14-0.amzn2023.x86_64.rpm 4.1 MB/s | 46 kB 00:00 2025-05-07T19:42:51.9098171Z (13/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 74 MB/s | 1.6 MB 00:00 2025-05-07T19:42:51.9115206Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 2.9 MB/s | 62 kB 00:00 2025-05-07T19:42:51.9157059Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 10 MB/s | 168 kB 00:00 2025-05-07T19:42:51.9194060Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 7.9 MB/s | 57 kB 00:00 2025-05-07T19:42:51.9236047Z (17/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 3.6 MB/s | 28 kB 00:00 2025-05-07T19:42:51.9293896Z (18/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 44 MB/s | 756 kB 00:00 2025-05-07T19:42:51.9316988Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 9.0 MB/s | 108 kB 00:00 2025-05-07T19:42:51.9353301Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 14 MB/s | 153 kB 00:00 2025-05-07T19:42:51.9376077Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 12 MB/s | 95 kB 00:00 2025-05-07T19:42:51.9395886Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 4.3 MB/s | 31 kB 00:00 2025-05-07T19:42:51.9425357Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 16 MB/s | 106 kB 00:00 2025-05-07T19:42:51.9450345Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 17 MB/s | 121 kB 00:00 2025-05-07T19:42:51.9468255Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 3.8 MB/s | 26 kB 00:00 2025-05-07T19:42:51.9544486Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 61 MB/s | 706 kB 00:00 2025-05-07T19:42:51.9552878Z (27/107): nano-default-editor-8.3-1.amzn2023.no 1.0 MB/s | 10 kB 00:00 2025-05-07T19:42:51.9605360Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 29 MB/s | 394 kB 00:00 2025-05-07T19:42:51.9653900Z (29/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 30 MB/s | 256 kB 00:00 2025-05-07T19:42:51.9706397Z (30/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 41 MB/s | 573 kB 00:00 2025-05-07T19:42:51.9751825Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 31 MB/s | 454 kB 00:00 2025-05-07T19:42:51.9810378Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 47 MB/s | 708 kB 00:00 2025-05-07T19:42:51.9860687Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 36 MB/s | 542 kB 00:00 2025-05-07T19:42:51.9876872Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 8.1 MB/s | 93 kB 00:00 2025-05-07T19:42:51.9902217Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.2 MB/s | 41 kB 00:00 2025-05-07T19:42:51.9932981Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 4.5 MB/s | 22 kB 00:00 2025-05-07T19:42:51.9974903Z (37/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 19 MB/s | 179 kB 00:00 2025-05-07T19:42:51.9995108Z (38/107): perl-Class-Struct-0.66-477.amzn2023.0 3.8 MB/s | 22 kB 00:00 2025-05-07T19:42:52.0012569Z (39/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 2.7 MB/s | 29 kB 00:00 2025-05-07T19:42:52.0031560Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 11 MB/s | 55 kB 00:00 2025-05-07T19:42:52.0050369Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 5.3 MB/s | 26 kB 00:00 2025-05-07T19:42:52.0074053Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.9 MB/s | 36 kB 00:00 2025-05-07T19:42:52.0094480Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.2 MB/s | 26 kB 00:00 2025-05-07T19:42:52.0211366Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 109 MB/s | 1.7 MB 00:00 2025-05-07T19:42:52.0227257Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 1.0 MB/s | 15 kB 00:00 2025-05-07T19:42:52.0239595Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 3.0 MB/s | 41 kB 00:00 2025-05-07T19:42:52.0266479Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 6.2 MB/s | 31 kB 00:00 2025-05-07T19:42:52.0298840Z (48/107): perl-File-Basename-2.85-477.amzn2023. 3.6 MB/s | 18 kB 00:00 2025-05-07T19:42:52.0316014Z (49/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 3.0 MB/s | 21 kB 00:00 2025-05-07T19:42:52.0340019Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 3.5 MB/s | 26 kB 00:00 2025-05-07T19:42:52.0357491Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.2 MB/s | 36 kB 00:00 2025-05-07T19:42:52.0376965Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 10 MB/s | 60 kB 00:00 2025-05-07T19:42:52.0407978Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 2.7 MB/s | 17 kB 00:00 2025-05-07T19:42:52.0429119Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.4 MB/s | 16 kB 00:00 2025-05-07T19:42:52.0449637Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 9.6 MB/s | 60 kB 00:00 2025-05-07T19:42:52.0466897Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.9 MB/s | 16 kB 00:00 2025-05-07T19:42:52.0483182Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 7.9 MB/s | 42 kB 00:00 2025-05-07T19:42:52.0512932Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 9.3 MB/s | 56 kB 00:00 2025-05-07T19:42:52.0550263Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 11 MB/s | 87 kB 00:00 2025-05-07T19:42:52.0567060Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 5.4 MB/s | 42 kB 00:00 2025-05-07T19:42:52.0608599Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 24 MB/s | 218 kB 00:00 2025-05-07T19:42:52.0628341Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 4.5 MB/s | 23 kB 00:00 2025-05-07T19:42:52.0642856Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.5 MB/s | 31 kB 00:00 2025-05-07T19:42:52.0665544Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.4 MB/s | 13 kB 00:00 2025-05-07T19:42:52.0691196Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 3.8 MB/s | 23 kB 00:00 2025-05-07T19:42:52.0729774Z (66/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 16 MB/s | 97 kB 00:00 2025-05-07T19:42:52.0767253Z (67/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 32 MB/s | 392 kB 00:00 2025-05-07T19:42:52.0792440Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 8.5 MB/s | 85 kB 00:00 2025-05-07T19:42:52.0803337Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 2.9 MB/s | 20 kB 00:00 2025-05-07T19:42:52.0877754Z (70/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 30 MB/s | 215 kB 00:00 2025-05-07T19:42:52.0896205Z (71/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 4.8 MB/s | 41 kB 00:00 2025-05-07T19:42:52.0919698Z (72/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 5.5 MB/s | 84 kB 00:00 2025-05-07T19:42:52.0958562Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 13 MB/s | 71 kB 00:00 2025-05-07T19:42:52.0967748Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 1.8 MB/s | 12 kB 00:00 2025-05-07T19:42:52.0994160Z (75/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 7.6 MB/s | 55 kB 00:00 2025-05-07T19:42:52.1028493Z (76/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.7 MB/s | 15 kB 00:00 2025-05-07T19:42:52.1060093Z (77/107): perl-Term-ANSIColor-5.01-459.amzn2023 7.6 MB/s | 48 kB 00:00 2025-05-07T19:42:52.1082918Z (78/107): perl-Storable-3.21-458.amzn2023.0.2.x 8.6 MB/s | 96 kB 00:00 2025-05-07T19:42:52.1110108Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 2.8 MB/s | 22 kB 00:00 2025-05-07T19:42:52.1133779Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 5.2 MB/s | 36 kB 00:00 2025-05-07T19:42:52.1154779Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 2.6 MB/s | 17 kB 00:00 2025-05-07T19:42:52.1192720Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.0 MB/s | 22 kB 00:00 2025-05-07T19:42:52.1217465Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 4.4 MB/s | 34 kB 00:00 2025-05-07T19:42:52.1238389Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 13 MB/s | 108 kB 00:00 2025-05-07T19:42:52.1262073Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.8 MB/s | 17 kB 00:00 2025-05-07T19:42:52.1306284Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 3.8 MB/s | 23 kB 00:00 2025-05-07T19:42:52.1315590Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 1.9 MB/s | 14 kB 00:00 2025-05-07T19:42:52.1349021Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 8.6 MB/s | 71 kB 00:00 2025-05-07T19:42:52.1367771Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 3.5 MB/s | 15 kB 00:00 2025-05-07T19:42:52.1394185Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 18 MB/s | 126 kB 00:00 2025-05-07T19:42:52.1547282Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 105 MB/s | 2.0 MB 00:00 2025-05-07T19:42:52.1567457Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.5 MB/s | 29 kB 00:00 2025-05-07T19:42:52.1586621Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.7 MB/s | 46 kB 00:00 2025-05-07T19:42:52.1625578Z (94/107): perl-overloading-0.02-477.amzn2023.0. 1.8 MB/s | 13 kB 00:00 2025-05-07T19:42:52.1669009Z (95/107): perl-podlators-4.14-458.amzn2023.0.2. 14 MB/s | 112 kB 00:00 2025-05-07T19:42:52.1691681Z (96/107): perl-parent-0.238-458.amzn2023.0.2.no 1.4 MB/s | 14 kB 00:00 2025-05-07T19:42:52.1708634Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.5 MB/s | 12 kB 00:00 2025-05-07T19:42:52.1719870Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.8 MB/s | 13 kB 00:00 2025-05-07T19:42:52.1836962Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 77 MB/s | 1.1 MB 00:00 2025-05-07T19:42:52.1932024Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 57 MB/s | 1.3 MB 00:00 2025-05-07T19:42:52.1946207Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 2.5 MB/s | 56 kB 00:00 2025-05-07T19:42:52.2016400Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 39 MB/s | 613 kB 00:00 2025-05-07T19:42:52.2105159Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 60 MB/s | 879 kB 00:00 2025-05-07T19:42:52.2259485Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 73 MB/s | 2.2 MB 00:00 2025-05-07T19:42:52.2316782Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 15 MB/s | 432 kB 00:00 2025-05-07T19:42:52.2367341Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 32 MB/s | 779 kB 00:00 2025-05-07T19:42:52.2388133Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 3.7 MB/s | 42 kB 00:00 2025-05-07T19:42:52.2400782Z -------------------------------------------------------------------------------- 2025-05-07T19:42:52.2402119Z Total 48 MB/s | 38 MB 00:00 2025-05-07T19:42:53.2905048Z Running transaction check 2025-05-07T19:42:53.3358568Z Transaction check succeeded. 2025-05-07T19:42:53.3359451Z Running transaction test 2025-05-07T19:42:53.7029776Z Transaction test succeeded. 2025-05-07T19:42:53.7030495Z Running transaction 2025-05-07T19:42:54.4254293Z Preparing : 1/1 2025-05-07T19:42:54.4390715Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:54.4632185Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:54.4832374Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:54.4902044Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:54.4966892Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:54.5060796Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:54.5340345Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:54.5398152Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:54.5450312Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:54.5950720Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:54.6032282Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:54.6338808Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:54.6390080Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:54.6449557Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:54.6506071Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:54.6553102Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:54.6690270Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:54.6735888Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:54.6792120Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:54.6866803Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:54.6924768Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:54.6973186Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:54.7394921Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:54.7472205Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:54.7612313Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:54.8028932Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:54.8193778Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:54.8996457Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:54.8998142Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:54.8999539Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:54.9000298Z 2025-05-07T19:42:54.9186409Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:54.9444569Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:54.9619599Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:54.9661444Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:55.0762588Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:55.2235755Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:55.2336181Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:55.2755352Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.2823369Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.2898996Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.2954766Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:55.3033413Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:55.3072747Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:55.3107088Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:55.3148505Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:55.3230848Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:55.3275213Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:55.3356003Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:55.3548776Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:55.3611907Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:55.3647803Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:55.3674935Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:55.3724128Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:55.3772109Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:55.3812928Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:55.3889684Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:55.3937926Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:55.3974962Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:55.4028770Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:55.4073007Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:55.4110694Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:55.4146041Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:55.4192113Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:55.4239973Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:55.4288833Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:55.4383727Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:55.4448801Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:55.4496692Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:55.4524830Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:55.4554678Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:55.4617384Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:55.4684553Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:55.4732479Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:55.4769550Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:55.4811952Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:55.4872057Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:55.4918323Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:55.4957045Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:55.5009899Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:55.5044708Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:55.5074926Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:55.5128009Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:55.5189282Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:55.5238624Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:55.5291599Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:55.5331352Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:55.5365849Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:55.5403346Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:55.5449744Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:55.5489604Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:55.5518040Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:55.5560205Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:55.5600410Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:55.5666357Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:55.6176631Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:55.7117301Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:55.7211488Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:55.7274509Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:55.7316159Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:55.7365206Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:55.7414772Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:55.7444046Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:55.7489679Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:55.7538423Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:55.7710637Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:55.7814574Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:55.7871783Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:55.8258964Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:55.9454173Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:55.9519272Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:55.9629113Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:55.9911268Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:55.9982885Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:56.0216059Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:56.0403421Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:56.0460451Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:56.0578251Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:56.8158466Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:56.8160388Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:56.8162401Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:56.8163700Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:56.8164338Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:56.8165295Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:56.8165890Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:56.8166528Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:56.8167130Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:56.8168092Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:56.8168748Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:56.8169344Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:56.8170015Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:56.8170599Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:56.8171259Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:56.8172049Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:56.8172644Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:56.8173299Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:56.8173917Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:56.8174569Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:56.8175230Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:56.8175838Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:56.8176582Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:56.8177176Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:56.8178113Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:56.8178782Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:56.8179427Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:56.8180192Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:56.8180813Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:56.8181496Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:56.8182165Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:56.8182758Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:56.8183437Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:56.8184052Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:56.8184696Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:56.8185529Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:56.8186180Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:56.8186863Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:56.8187434Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:56.8188245Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:56.8188882Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:56.8189526Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:56.8190245Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:56.8190897Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:56.8191724Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:56.8192445Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:56.8193127Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:56.8193724Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:56.8194287Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:56.8194831Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:56.8195412Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:56.8195973Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:56.8196515Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:56.8197075Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:56.8197624Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:56.8198198Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:56.8198773Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:56.8199316Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:56.8199874Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:56.8200402Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:56.8200958Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:56.8201511Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:56.8202084Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:56.8202653Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:56.8203201Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:56.8203765Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:56.8204299Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:56.8204854Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:56.8205395Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:56.8205964Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:56.8206533Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:56.8207082Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:56.8207722Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:56.8208256Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:56.8208830Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:56.8209383Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:56.8209903Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:56.8210450Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:56.8210996Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:56.8211559Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:56.8212097Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:56.8212676Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:56.8213253Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:56.8213796Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:56.8214408Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:56.8214931Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:56.8215489Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:56.8216045Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:56.8216572Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:56.8217116Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:56.8217639Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:56.8218209Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:56.8218831Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:56.8219366Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:56.8219923Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:56.8220458Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:56.8220998Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:56.8221520Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:56.8222055Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:56.8222586Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:56.8223086Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:56.8223629Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:56.8224165Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:56.8224675Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:56.8225168Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:56.8225701Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:56.8226227Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:56.9217737Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:56.9218872Z 2025-05-07T19:42:56.9219117Z Installed: 2025-05-07T19:42:56.9220069Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:56.9221137Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9221694Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:56.9222286Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9222873Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9223380Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9223898Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9224528Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.9225051Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:56.9225571Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9226188Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:56.9226678Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:56.9228361Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:56.9228878Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:56.9229368Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9229834Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9230325Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9230799Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:56.9231313Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9232139Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.9232693Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9233247Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9233804Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9234376Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9234926Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9235454Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:56.9235985Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:56.9236562Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9237105Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:56.9237626Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:56.9238276Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:56.9238793Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:56.9239300Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:56.9239783Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9240288Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9240849Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9241369Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9241891Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.9242519Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9243088Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9243650Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.9244175Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9244733Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9245258Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9245790Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9246333Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:56.9246862Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:56.9247401Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9247925Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9248553Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9249076Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.9249604Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.9250135Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9250663Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9251205Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.9251724Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9252253Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.9252765Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:56.9253288Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9253837Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:56.9254380Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.9254946Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9255488Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9256065Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:56.9256630Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9257154Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:56.9257704Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9258231Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9258792Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.9259334Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:56.9259892Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.9260430Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.9260962Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9261532Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9262067Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9262680Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9263226Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9263775Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:56.9264340Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.9265291Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9265902Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.9266500Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:56.9267092Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:56.9267647Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.9268188Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9268758Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:56.9269317Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9270086Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9270632Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9271186Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.9271856Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9272401Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.9272962Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9273537Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9274127Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.9274692Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.9275247Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9275809Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.9276354Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:56.9276899Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:56.9277438Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:56.9278115Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:56.9278605Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.9279075Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.9279736Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.9280220Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.9280696Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:56.9280985Z 2025-05-07T19:42:56.9281091Z Complete! 2025-05-07T19:42:56.9965706Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:56.9966116Z with: 2025-05-07T19:42:56.9966360Z submodules: true 2025-05-07T19:42:56.9966666Z repository: pytorch/FBGEMM 2025-05-07T19:42:56.9967224Z token: *** 2025-05-07T19:42:56.9967464Z ssh-strict: true 2025-05-07T19:42:56.9967743Z ssh-user: git 2025-05-07T19:42:56.9968007Z persist-credentials: true 2025-05-07T19:42:56.9968336Z clean: true 2025-05-07T19:42:56.9968602Z sparse-checkout-cone-mode: true 2025-05-07T19:42:56.9969160Z fetch-depth: 1 2025-05-07T19:42:56.9969411Z fetch-tags: false 2025-05-07T19:42:56.9969703Z show-progress: true 2025-05-07T19:42:56.9969967Z lfs: false 2025-05-07T19:42:56.9970245Z set-safe-directory: true 2025-05-07T19:42:56.9970525Z env: 2025-05-07T19:42:56.9970797Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:56.9971161Z BUILD_ENV: build_binary 2025-05-07T19:42:56.9971430Z BUILD_TARGET: default 2025-05-07T19:42:56.9971768Z BUILD_VARIANT: cuda 2025-05-07T19:42:56.9972108Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:56.9972414Z ##[endgroup] 2025-05-07T19:42:57.0016295Z ##[command]/usr/bin/docker exec 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:42:57.2855781Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:42:57.2857157Z ##[group]Getting Git version info 2025-05-07T19:42:57.2857527Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:42:57.2858070Z [command]/usr/bin/git version 2025-05-07T19:42:57.2858380Z git version 2.47.1 2025-05-07T19:42:57.2859339Z ##[endgroup] 2025-05-07T19:42:57.2863622Z Temporarily overriding HOME='/__w/_temp/65996e45-506e-495c-87ad-92ea900fa516' before making global git config changes 2025-05-07T19:42:57.2864437Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:42:57.2867932Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:42:57.2900461Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:42:57.2917535Z https://github.com/pytorch/FBGEMM 2025-05-07T19:42:57.2932588Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:42:57.2935798Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:42:57.2954317Z HEAD 2025-05-07T19:42:57.2990064Z ##[endgroup] 2025-05-07T19:42:57.2990822Z [command]/usr/bin/git submodule status 2025-05-07T19:42:57.3358362Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:42:57.3434183Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (remotes/origin/FBGEMM) 2025-05-07T19:42:57.3538126Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:42:57.3611102Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (remotes/origin/FBGEMM) 2025-05-07T19:42:57.3835213Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (release-1.8.0-3335-gf8d7d77c) 2025-05-07T19:42:57.3916088Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (remotes/origin/mmelesse-9-g4200844) 2025-05-07T19:42:57.3949864Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (v3.11.2-84-g9cca280a) 2025-05-07T19:42:57.3966965Z ##[group]Cleaning the repository 2025-05-07T19:42:57.3968018Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:42:57.4825678Z Removing amdgpu-install_6.2.60204-1_all.deb 2025-05-07T19:42:57.4826801Z Removing collect_env.py 2025-05-07T19:42:57.4827570Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:42:57.4828674Z Removing fbgemm_gpu/bench/verify_fp16_stochastic_benchmark.hip 2025-05-07T19:42:57.4829994Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:42:57.4831834Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_cpu_template_hip.cpp 2025-05-07T19:42:57.4833712Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu_hip.cpp 2025-05-07T19:42:57.4834376Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_hip.cpp 2025-05-07T19:42:57.4835338Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.hip 2025-05-07T19:42:57.4836090Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_host_template.hip 2025-05-07T19:42:57.4836883Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_kernel_template.hip 2025-05-07T19:42:57.4837667Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu_hip.cpp 2025-05-07T19:42:57.4838752Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_approx_template_hip.cpp 2025-05-07T19:42:57.4839707Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_template_hip.cpp 2025-05-07T19:42:57.4840523Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_device_kernel_template_hip.cuh 2025-05-07T19:42:57.4841304Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_grad_template.hip 2025-05-07T19:42:57.4842087Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_cpu_template_hip.cpp 2025-05-07T19:42:57.4842884Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_template_hip.cpp 2025-05-07T19:42:57.4843783Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_indice_weights_template.hip 2025-05-07T19:42:57.4844578Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_cta_template.hip 2025-05-07T19:42:57.4845434Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_warp_template.hip 2025-05-07T19:42:57.4846176Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_meta_template_hip.cpp 2025-05-07T19:42:57.4846856Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_template.hip 2025-05-07T19:42:57.4847478Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu_hip.cpp 2025-05-07T19:42:57.4848187Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_nobag_small_template.hip 2025-05-07T19:42:57.4849101Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_template.hip 2025-05-07T19:42:57.4850084Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_v2_template.hip 2025-05-07T19:42:57.4850796Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_template.hip 2025-05-07T19:42:57.4851507Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host_hip.cpp 2025-05-07T19:42:57.4852225Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops_hip.cpp 2025-05-07T19:42:57.4853004Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_device_kernel_template_hip.cuh 2025-05-07T19:42:57.4853854Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_host_template_hip.cpp 2025-05-07T19:42:57.4854631Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_kernel_template.hip 2025-05-07T19:42:57.4855491Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_template.hip 2025-05-07T19:42:57.4856313Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_autograd_template_hip.cpp 2025-05-07T19:42:57.4857017Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_cpu_wrapper_template_hip.cpp 2025-05-07T19:42:57.4857729Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_hip_wrapper_template.cpp 2025-05-07T19:42:57.4858343Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu_hip.cpp 2025-05-07T19:42:57.4858887Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_hip.cpp 2025-05-07T19:42:57.4859384Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.hip 2025-05-07T19:42:57.4859841Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.hip 2025-05-07T19:42:57.4860224Z Removing fbgemm_gpu/dist/ 2025-05-07T19:42:57.4860569Z Removing fbgemm_gpu/experimental/example/src/cutlass_sgemm_nn.hip 2025-05-07T19:42:57.4861183Z Removing fbgemm_gpu/experimental/example/src/example_nccl_hip.cpp 2025-05-07T19:42:57.4861703Z Removing fbgemm_gpu/experimental/gen_ai/src/attention/gqa_attn_splitk.hip 2025-05-07T19:42:57.4862232Z Removing fbgemm_gpu/experimental/gen_ai/src/coalesce/coalesce.hip 2025-05-07T19:42:57.4862694Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car.hip 2025-05-07T19:42:57.4863112Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car_hip.cpp 2025-05-07T19:42:57.4896073Z Removing fbgemm_gpu/experimental/gen_ai/src/gather_scatter/gather_scatter.hip 2025-05-07T19:42:57.4896935Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache.hip 2025-05-07T19:42:57.4897501Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache_hip.cpp 2025-05-07T19:42:57.4898315Z Removing fbgemm_gpu/experimental/gen_ai/src/moe/index_shuffling.hip 2025-05-07T19:42:57.4899068Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/bf16_grouped/kernels/bf16_grouped_common_hip.h 2025-05-07T19:42:57.4900000Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise/kernels/fp8_rowwise_common_hip.h 2025-05-07T19:42:57.4901004Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_batched/kernels/fp8_rowwise_batched_common_hip.h 2025-05-07T19:42:57.4902045Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fp8_rowwise_grouped_common_hip.h 2025-05-07T19:42:57.4902972Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fused_moe/fused_moe_op_hip.cpp 2025-05-07T19:42:57.4903664Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cublas_utils_hip.h 2025-05-07T19:42:57.4904338Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16bf16bf16_grouped.hip 2025-05-07T19:42:57.4905085Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16.hip 2025-05-07T19:42:57.4905848Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_rowwise_batched.hip 2025-05-07T19:42:57.4906702Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_shuffled_grouped.hip 2025-05-07T19:42:57.4907476Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16.hip 2025-05-07T19:42:57.4908357Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_f.hip 2025-05-07T19:42:57.4909221Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_t.hip 2025-05-07T19:42:57.4910072Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_f.hip 2025-05-07T19:42:57.4910937Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_t.hip 2025-05-07T19:42:57.4912083Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_f.hip 2025-05-07T19:42:57.4912963Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_t.hip 2025-05-07T19:42:57.4913849Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_f.hip 2025-05-07T19:42:57.4914718Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_t.hip 2025-05-07T19:42:57.4915601Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_f.hip 2025-05-07T19:42:57.4916486Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_t.hip 2025-05-07T19:42:57.4917366Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_f.hip 2025-05-07T19:42:57.4918249Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_t.hip 2025-05-07T19:42:57.4919117Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_f.hip 2025-05-07T19:42:57.4920121Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_t.hip 2025-05-07T19:42:57.4921001Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_f.hip 2025-05-07T19:42:57.4921869Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_t.hip 2025-05-07T19:42:57.4922750Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_f.hip 2025-05-07T19:42:57.4923675Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_t.hip 2025-05-07T19:42:57.4924624Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_f.hip 2025-05-07T19:42:57.4925439Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_t.hip 2025-05-07T19:42:57.4926241Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_f.hip 2025-05-07T19:42:57.4927056Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_t.hip 2025-05-07T19:42:57.4927860Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_f.hip 2025-05-07T19:42:57.4928676Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_t.hip 2025-05-07T19:42:57.4929488Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_common_hip.cuh 2025-05-07T19:42:57.4930274Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_manifest_hip.cuh 2025-05-07T19:42:57.4930997Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16.hip 2025-05-07T19:42:57.4931670Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_blockwise.hip 2025-05-07T19:42:57.4932379Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_cublas.hip 2025-05-07T19:42:57.4933060Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_lite.hip 2025-05-07T19:42:57.4933729Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise.hip 2025-05-07T19:42:57.4934584Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_128_128_2_1_1_t_f.hip 2025-05-07T19:42:57.4935573Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_2_1_1_f_t.hip 2025-05-07T19:42:57.4936570Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_4_4_1_f_t.hip 2025-05-07T19:42:57.4937562Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_128_128_1_1_1_f_f.hip 2025-05-07T19:42:57.4938541Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_16_128_1_1_1_f_f.hip 2025-05-07T19:42:57.4939523Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_1_1_1_f_f.hip 2025-05-07T19:42:57.4940506Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_2_1_1_f_f.hip 2025-05-07T19:42:57.4941476Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_32_128_2_1_1_f_f.hip 2025-05-07T19:42:57.4942460Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_64_128_2_1_1_f_f.hip 2025-05-07T19:42:57.4943403Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_common_hip.cuh 2025-05-07T19:42:57.4944346Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/common_hip.cuh 2025-05-07T19:42:57.4945435Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_cluster_size_and_transpose.hip 2025-05-07T19:42:57.4946662Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_tile_size.hip 2025-05-07T19:42:57.4947736Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched.hip 2025-05-07T19:42:57.4948788Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched_impl.hip 2025-05-07T19:42:57.4949758Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/handle_transposition.hip 2025-05-07T19:42:57.4950615Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_grouped.hip 2025-05-07T19:42:57.4951434Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_tensorwise.hip 2025-05-07T19:42:57.4952354Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_rowwise.hip 2025-05-07T19:42:57.4953187Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled.hip 2025-05-07T19:42:57.4953979Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled_grouped.hip 2025-05-07T19:42:57.4954738Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16.hip 2025-05-07T19:42:57.4955453Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16_dynamic.hip 2025-05-07T19:42:57.4956315Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/include/fp8_blockwise_cutlass_helpers_hip.h 2025-05-07T19:42:57.4957188Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/mixed_dtype_utils.hip 2025-05-07T19:42:57.4957892Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16_fast_gemv.hip 2025-05-07T19:42:57.4958761Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16fp8bf16_fast_gemv.hip 2025-05-07T19:42:57.4959459Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/fp8fp8bf16_fast_gemv.hip 2025-05-07T19:42:57.4960153Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv.hip 2025-05-07T19:42:57.4960850Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv_hip.cuh 2025-05-07T19:42:57.4961541Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/utility_hip.cuh 2025-05-07T19:42:57.4962155Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize.hip 2025-05-07T19:42:57.4962684Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize_hip.cpp 2025-05-07T19:42:57.4963161Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:42:57.4963540Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:42:57.4963971Z Removing fbgemm_gpu/include/fbgemm_gpu/cumem_utils_hip.h 2025-05-07T19:42:57.4964531Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_backward_template_helpers_hip.cuh 2025-05-07T19:42:57.4965334Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_split_cpu_hip.h 2025-05-07T19:42:57.4965957Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_template_helpers_hip.cuh 2025-05-07T19:42:57.4966541Z Removing fbgemm_gpu/include/fbgemm_gpu/layout_transform_ops_hip.cuh 2025-05-07T19:42:57.4967121Z Removing fbgemm_gpu/include/fbgemm_gpu/permute_multi_embedding_function_hip.h 2025-05-07T19:42:57.4967667Z Removing fbgemm_gpu/include/fbgemm_gpu/quantize_ops_hip.cuh 2025-05-07T19:42:57.4968125Z Removing fbgemm_gpu/include/fbgemm_gpu/sparse_ops_hip.cuh 2025-05-07T19:42:57.4968633Z Removing fbgemm_gpu/include/fbgemm_gpu/split_embeddings_utils_hip.cuh 2025-05-07T19:42:57.4969177Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/barrier_isolation_hip.cuh 2025-05-07T19:42:57.4969832Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bench_utils_hip.cuh 2025-05-07T19:42:57.4970339Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bitonic_sort_hip.cuh 2025-05-07T19:42:57.4970900Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_postfix_hip.cuh 2025-05-07T19:42:57.4971496Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_prefix_hip.cuh 2025-05-07T19:42:57.4972071Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_cache_flusher_hip.cuh 2025-05-07T19:42:57.4972656Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_properties_hip.cuh 2025-05-07T19:42:57.4973287Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/dispatch_macros_hip.h 2025-05-07T19:42:57.4973885Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/embedding_bounds_check_common_hip.cuh 2025-05-07T19:42:57.4974465Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/find_qparams_hip.cuh 2025-05-07T19:42:57.4974965Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/float_hip.cuh 2025-05-07T19:42:57.4975438Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/hip_prelude.cuh 2025-05-07T19:42:57.4975978Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/host_device_buffer_pair_hip.cuh 2025-05-07T19:42:57.4976578Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/inclusive_sum_scan_hip.cuh 2025-05-07T19:42:57.4977235Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/kernel_launcher_hip.cuh 2025-05-07T19:42:57.4977772Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/stochastic_rounding_hip.h 2025-05-07T19:42:57.4978270Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/vec2_hip.h 2025-05-07T19:42:57.4978743Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/weight_row_hip.h 2025-05-07T19:42:57.4979240Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/shared_memory_hip.cuh 2025-05-07T19:42:57.4979738Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding_hip.cuh 2025-05-07T19:42:57.4980289Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_builder_hip.h 2025-05-07T19:42:57.4980797Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_hip.h 2025-05-07T19:42:57.4981261Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4_hip.cuh 2025-05-07T19:42:57.4981681Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4acc_hip.cuh 2025-05-07T19:42:57.4982132Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec_quant_hip.cuh 2025-05-07T19:42:57.4982577Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vecn_hip.cuh 2025-05-07T19:42:57.4983011Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/weight_row_hip.cuh 2025-05-07T19:42:57.4983531Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_hip.h 2025-05-07T19:42:57.4984114Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_wrapper_hip.h 2025-05-07T19:42:57.4984712Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.hip 2025-05-07T19:42:57.4985304Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu_hip.cpp 2025-05-07T19:42:57.4985831Z Removing fbgemm_gpu/src/histogram_binning_calibration_ops.hip 2025-05-07T19:42:57.4986289Z Removing fbgemm_gpu/src/input_combine_ops/input_combine.hip 2025-05-07T19:42:57.4986749Z Removing fbgemm_gpu/src/input_combine_ops/input_combine_cpu_hip.cpp 2025-05-07T19:42:57.4987335Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.hip 2025-05-07T19:42:57.4988027Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu_hip.cpp 2025-05-07T19:42:57.4988711Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.hip 2025-05-07T19:42:57.4989334Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.hip 2025-05-07T19:42:57.4989851Z Removing fbgemm_gpu/src/jagged_tensor_ops/common_hip.cuh 2025-05-07T19:42:57.4990309Z Removing fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.hip 2025-05-07T19:42:57.4990811Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.hip 2025-05-07T19:42:57.4991524Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.hip 2025-05-07T19:42:57.4992575Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.hip 2025-05-07T19:42:57.4993223Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.hip 2025-05-07T19:42:57.4993841Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.hip 2025-05-07T19:42:57.4994419Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.hip 2025-05-07T19:42:57.4995003Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.hip 2025-05-07T19:42:57.4995609Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.hip 2025-05-07T19:42:57.4996145Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.hip 2025-05-07T19:42:57.4996681Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.hip 2025-05-07T19:42:57.4997203Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu_hip.cpp 2025-05-07T19:42:57.4997800Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.hip 2025-05-07T19:42:57.4998419Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.hip 2025-05-07T19:42:57.4998979Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.hip 2025-05-07T19:42:57.4999543Z Removing fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.hip 2025-05-07T19:42:57.5000111Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.hip 2025-05-07T19:42:57.5000695Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu_hip.cpp 2025-05-07T19:42:57.5001204Z Removing fbgemm_gpu/src/memory_utils/common_hip.cuh 2025-05-07T19:42:57.5001621Z Removing fbgemm_gpu/src/memory_utils/memory_utils.hip 2025-05-07T19:42:57.5002066Z Removing fbgemm_gpu/src/memory_utils/memory_utils_hip.cpp 2025-05-07T19:42:57.5002506Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops.hip 2025-05-07T19:42:57.5002976Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops_hip.cpp 2025-05-07T19:42:57.5003558Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:57.5004362Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu_hip.cpp 2025-05-07T19:42:57.5004880Z Removing fbgemm_gpu/src/metric_ops/metric_ops.hip 2025-05-07T19:42:57.5005390Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function_hip.cpp 2025-05-07T19:42:57.5006045Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.hip 2025-05-07T19:42:57.5006678Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:57.5007341Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.hip 2025-05-07T19:42:57.5007993Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:57.5008683Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.hip 2025-05-07T19:42:57.5009369Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_split_table_batched_embeddings_hip.cpp 2025-05-07T19:42:57.5009999Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_table_batched_embeddings_hip.h 2025-05-07T19:42:57.5010500Z Removing fbgemm_gpu/src/quantize_ops/common_hip.cuh 2025-05-07T19:42:57.5010879Z Removing fbgemm_gpu/src/quantize_ops/mx/common_hip.cuh 2025-05-07T19:42:57.5011278Z Removing fbgemm_gpu/src/quantize_ops/mx_common_hip.cuh 2025-05-07T19:42:57.5011692Z Removing fbgemm_gpu/src/quantize_ops/quantize_bfloat16.hip 2025-05-07T19:42:57.5012122Z Removing fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.hip 2025-05-07T19:42:57.5012598Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.hip 2025-05-07T19:42:57.5013091Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.hip 2025-05-07T19:42:57.5013546Z Removing fbgemm_gpu/src/quantize_ops/quantize_hfp8.hip 2025-05-07T19:42:57.5013929Z Removing fbgemm_gpu/src/quantize_ops/quantize_msfp.hip 2025-05-07T19:42:57.5014323Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx.hip 2025-05-07T19:42:57.5014789Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx_hip.cuh 2025-05-07T19:42:57.5015226Z Removing fbgemm_gpu/src/quantize_ops/quantize_ops_cpu_hip.cpp 2025-05-07T19:42:57.5015700Z Removing fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.hip 2025-05-07T19:42:57.5016130Z Removing fbgemm_gpu/src/sparse_ops/common_hip.cuh 2025-05-07T19:42:57.5016556Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.hip 2025-05-07T19:42:57.5017047Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum_hip.cpp 2025-05-07T19:42:57.5017590Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.hip 2025-05-07T19:42:57.5018023Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum_hip.cpp 2025-05-07T19:42:57.5018516Z Removing fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.hip 2025-05-07T19:42:57.5019043Z Removing fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.hip 2025-05-07T19:42:57.5019528Z Removing fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.hip 2025-05-07T19:42:57.5020039Z Removing fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.hip 2025-05-07T19:42:57.5020563Z Removing fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.hip 2025-05-07T19:42:57.5021048Z Removing fbgemm_gpu/src/sparse_ops/sparse_group_index.hip 2025-05-07T19:42:57.5021466Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_add.hip 2025-05-07T19:42:57.5021870Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_select.hip 2025-05-07T19:42:57.5022307Z Removing fbgemm_gpu/src/sparse_ops/sparse_invert_permute.hip 2025-05-07T19:42:57.5022732Z Removing fbgemm_gpu/src/sparse_ops/sparse_ops_cpu_hip.cpp 2025-05-07T19:42:57.5023199Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.hip 2025-05-07T19:42:57.5023688Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.hip 2025-05-07T19:42:57.5024158Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute102.hip 2025-05-07T19:42:57.5024579Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_1d.hip 2025-05-07T19:42:57.5024988Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_2d.hip 2025-05-07T19:42:57.5025433Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.hip 2025-05-07T19:42:57.5025850Z Removing fbgemm_gpu/src/sparse_ops/sparse_range.hip 2025-05-07T19:42:57.5026274Z Removing fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.hip 2025-05-07T19:42:57.5026720Z Removing fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.hip 2025-05-07T19:42:57.5027134Z Removing fbgemm_gpu/src/sparse_ops/sparse_zipf.hip 2025-05-07T19:42:57.5027571Z Removing fbgemm_gpu/src/split_embeddings_cache/cachelib_cache_hip.cpp 2025-05-07T19:42:57.5028161Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.cuh 2025-05-07T19:42:57.5028610Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.h 2025-05-07T19:42:57.5029057Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.hip 2025-05-07T19:42:57.5029561Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.hip 2025-05-07T19:42:57.5030085Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.hip 2025-05-07T19:42:57.5030651Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte_hip.cpp 2025-05-07T19:42:57.5031213Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.hip 2025-05-07T19:42:57.5032020Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices_hip.cpp 2025-05-07T19:42:57.5032603Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.hip 2025-05-07T19:42:57.5033214Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.hip 2025-05-07T19:42:57.5033793Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.hip 2025-05-07T19:42:57.5034385Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte_hip.cpp 2025-05-07T19:42:57.5034938Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache.hip 2025-05-07T19:42:57.5035431Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache_hip.cpp 2025-05-07T19:42:57.5036074Z Removing fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.hip 2025-05-07T19:42:57.5036680Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.hip 2025-05-07T19:42:57.5037300Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops_hip.cpp 2025-05-07T19:42:57.5037913Z Removing fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.hip 2025-05-07T19:42:57.5038477Z Removing fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.hip 2025-05-07T19:42:57.5039005Z Removing fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.hip 2025-05-07T19:42:57.5039990Z Removing fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_hip.cpp 2025-05-07T19:42:57.5040591Z Removing fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.hip 2025-05-07T19:42:57.5041223Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/embedding_rocksdb_wrapper_hip.h 2025-05-07T19:42:57.5041816Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.cpp 2025-05-07T19:42:57.5042364Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.h 2025-05-07T19:42:57.5042983Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.cpp 2025-05-07T19:42:57.5043661Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.h 2025-05-07T19:42:57.5044412Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_tensor_wrapper_cpu_hip.cpp 2025-05-07T19:42:57.5045011Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_scratch_pad_indices_queue_hip.cpp 2025-05-07T19:42:57.5045644Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_embeddings_cache_hip.hip 2025-05-07T19:42:57.5046298Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_table_batched_embeddings_hip.cpp 2025-05-07T19:42:57.5046939Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_table_batched_embeddings_hip.h 2025-05-07T19:42:57.5047437Z Removing fbgemm_gpu/src/topology_utils_hip.cpp 2025-05-07T19:42:57.5047819Z Removing fbgemm_gpu/test/tbe/utils/cpu_kernel_test_hip.cpp 2025-05-07T19:42:57.5048238Z Removing fbgemm_gpu/test/utils/kernel_launcher_test.hip 2025-05-07T19:42:57.5048650Z Removing fbgemm_gpu/test/utils/stochastic_rounding_test.hip 2025-05-07T19:42:57.5049082Z Removing fbgemm_gpu/test/utils/tensor_accessor2_test.hip 2025-05-07T19:42:57.5049523Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_test.hip 2025-05-07T19:42:57.5050032Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_with_memcheck_test.hip 2025-05-07T19:42:57.5050529Z Removing fbgemm_gpu/test/utils/tensor_accessor_test.hip 2025-05-07T19:42:57.5050976Z Removing fbgemm_gpu/test/utils/tensor_accessor_with_memcheck_test.hip 2025-05-07T19:42:57.5051422Z Removing fbgemm_gpu/test/utils/weight_row_test.hip 2025-05-07T19:42:57.5053441Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:42:57.5950083Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:57.5953713Z ##[endgroup] 2025-05-07T19:42:57.5954936Z ##[group]Disabling automatic garbage collection 2025-05-07T19:42:57.5959480Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:42:57.5989627Z ##[endgroup] 2025-05-07T19:42:57.5990721Z ##[group]Setting up auth 2025-05-07T19:42:57.5992627Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:42:57.6020473Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:42:57.6301995Z Entering 'external/asmjit' 2025-05-07T19:42:57.6350185Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.6401925Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.6451525Z Entering 'external/cutlass' 2025-05-07T19:42:57.6526679Z Entering 'external/googletest' 2025-05-07T19:42:57.6574061Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.6624210Z Entering 'external/json' 2025-05-07T19:42:57.6680651Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:42:57.6706767Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:42:57.6973558Z Entering 'external/asmjit' 2025-05-07T19:42:57.7023337Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.7082905Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.7132847Z Entering 'external/cutlass' 2025-05-07T19:42:57.7193186Z Entering 'external/googletest' 2025-05-07T19:42:57.7250735Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.7298717Z Entering 'external/json' 2025-05-07T19:42:57.7358554Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:57.7402698Z ##[endgroup] 2025-05-07T19:42:57.7403126Z ##[group]Fetching the repository 2025-05-07T19:42:57.7410237Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:42:57.9261648Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:42:57.9263462Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:42:57.9275227Z ##[endgroup] 2025-05-07T19:42:57.9275638Z ##[group]Determining the checkout info 2025-05-07T19:42:57.9277342Z ##[endgroup] 2025-05-07T19:42:57.9281742Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:42:57.9791945Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:42:57.9817715Z ##[group]Checking out the ref 2025-05-07T19:42:57.9819050Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:42:58.0790055Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:42:58.0791220Z any of your branches: 2025-05-07T19:42:58.0791917Z 2025-05-07T19:42:58.0793056Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:58.0794409Z 2025-05-07T19:42:58.0794622Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:42:58.0795049Z to do so with: 2025-05-07T19:42:58.0795186Z 2025-05-07T19:42:58.0795330Z git branch 1c9ad64 2025-05-07T19:42:58.0795543Z 2025-05-07T19:42:58.0795942Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:58.0797268Z ##[endgroup] 2025-05-07T19:42:58.0797684Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:42:58.0798293Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:58.0852293Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:42:58.0872455Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:42:58.0899107Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:42:58.0925505Z ##[endgroup] 2025-05-07T19:42:58.0926737Z ##[group]Fetching submodules 2025-05-07T19:42:58.0928340Z [command]/usr/bin/git submodule sync 2025-05-07T19:42:58.1232847Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:42:58.1234244Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:42:58.1235602Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:42:58.1236772Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:42:58.1237960Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:42:58.1239038Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:42:58.1239452Z Synchronizing submodule url for 'external/json' 2025-05-07T19:42:58.1240574Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:42:58.2005855Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:42:58.4796413Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:42:58.5833964Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:42:59.2594571Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:42:59.3035909Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:42:59.3115968Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:42:59.4314624Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:42:59.4325990Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:42:59.4624202Z Entering 'external/asmjit' 2025-05-07T19:42:59.4644066Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.4671174Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.4703740Z Entering 'external/cutlass' 2025-05-07T19:42:59.4724200Z Entering 'external/googletest' 2025-05-07T19:42:59.4751187Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.4786570Z Entering 'external/json' 2025-05-07T19:42:59.4827140Z ##[endgroup] 2025-05-07T19:42:59.4828334Z ##[group]Persisting credentials for submodules 2025-05-07T19:42:59.4831622Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:42:59.5171433Z Entering 'external/asmjit' 2025-05-07T19:42:59.5231916Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.5302204Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.5355606Z Entering 'external/cutlass' 2025-05-07T19:42:59.5432031Z Entering 'external/googletest' 2025-05-07T19:42:59.5489558Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.5550732Z Entering 'external/json' 2025-05-07T19:42:59.5621668Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:42:59.5913761Z Entering 'external/asmjit' 2025-05-07T19:42:59.5957875Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:42:59.5958422Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.6017789Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:42:59.6020320Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.6072482Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:42:59.6074250Z Entering 'external/cutlass' 2025-05-07T19:42:59.6124806Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:42:59.6127353Z Entering 'external/googletest' 2025-05-07T19:42:59.6176752Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:42:59.6180658Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.6230458Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:42:59.6232948Z Entering 'external/json' 2025-05-07T19:42:59.6278082Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:42:59.6394916Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:42:59.6669859Z Entering 'external/asmjit' 2025-05-07T19:42:59.6695190Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.6723775Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.6753156Z Entering 'external/cutlass' 2025-05-07T19:42:59.6788864Z Entering 'external/googletest' 2025-05-07T19:42:59.6816581Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.6853613Z Entering 'external/json' 2025-05-07T19:42:59.6890101Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:42:59.7159737Z Entering 'external/asmjit' 2025-05-07T19:42:59.7186909Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.7211557Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.7246564Z Entering 'external/cutlass' 2025-05-07T19:42:59.7275674Z Entering 'external/googletest' 2025-05-07T19:42:59.7307834Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.7339133Z Entering 'external/json' 2025-05-07T19:42:59.7368827Z ##[endgroup] 2025-05-07T19:42:59.7398046Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:42:59.7420229Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:42:59.7603912Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:42:59.7604338Z . $PRELUDE; print_system_info 2025-05-07T19:42:59.7604862Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:59.7605223Z env: 2025-05-07T19:42:59.7605494Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:59.7605814Z BUILD_ENV: build_binary 2025-05-07T19:42:59.7606105Z BUILD_TARGET: default 2025-05-07T19:42:59.7606354Z BUILD_VARIANT: cuda 2025-05-07T19:42:59.7606635Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:59.7606899Z ##[endgroup] 2025-05-07T19:43:00.2052559Z ################################################################################ 2025-05-07T19:43:00.2052974Z # Print System Info 2025-05-07T19:43:00.2053369Z # 2025-05-07T19:43:00.2072895Z # [2025-05-07T19:43:00.206Z] + print_system_info 2025-05-07T19:43:00.2073963Z ################################################################################ 2025-05-07T19:43:00.2074479Z 2025-05-07T19:43:00.2074722Z ################################################################################ 2025-05-07T19:43:00.2075121Z [INFO] Printing environment variables ... 2025-05-07T19:43:00.2075482Z + printenv 2025-05-07T19:43:00.2075614Z 2025-05-07T19:43:00.2090838Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:00.2091291Z BUILD_VARIANT=cuda 2025-05-07T19:43:00.2091830Z HOSTNAME=2c96c3f709dd 2025-05-07T19:43:00.2092418Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_709f67cf-745c-4d20-b536-d042ece5385a 2025-05-07T19:43:00.2093036Z GITHUB_ACTION=__run_2 2025-05-07T19:43:00.2093312Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:00.2093634Z RUNNER_NAME=i-0a33abd677e10917f 2025-05-07T19:43:00.2093981Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:00.2094375Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:00.2094676Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:00.2094982Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:00.2095328Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:00.2095683Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:00.2096278Z *** 2025-05-07T19:43:00.2096517Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:00.2096842Z GITHUB_ACTIONS=true 2025-05-07T19:43:00.2097178Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:00.2097817Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:00.2098399Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:00.2098756Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:00.2099079Z RUNNER_OS=Linux 2025-05-07T19:43:00.2099336Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:00.2099641Z HOME=/github/home 2025-05-07T19:43:00.2099926Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:00.2100273Z RUNNER_ARCH=X64 2025-05-07T19:43:00.2100516Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:00.2100821Z BUILD_TARGET=default 2025-05-07T19:43:00.2101268Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_709f67cf-745c-4d20-b536-d042ece5385a 2025-05-07T19:43:00.2101969Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_709f67cf-745c-4d20-b536-d042ece5385a 2025-05-07T19:43:00.2102520Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:00.2102876Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:00.2103209Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:00.2103702Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_709f67cf-745c-4d20-b536-d042ece5385a 2025-05-07T19:43:00.2104271Z BUILD_ENV=build_binary 2025-05-07T19:43:00.2104529Z GITHUB_ACTOR=q10 2025-05-07T19:43:00.2104801Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:00.2105055Z KERN_NAME_LC=linux 2025-05-07T19:43:00.2105368Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:00.2105711Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:00.2106116Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:00.2106688Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:00.2107043Z SHLVL=1 2025-05-07T19:43:00.2107280Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:00.2107594Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:00.2108104Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:00.2108551Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:00.2108872Z KERN_NAME=Linux 2025-05-07T19:43:00.2109133Z GITHUB_JOB=build_artifact 2025-05-07T19:43:00.2109467Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:00.2109789Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:00.2110115Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:00.2110415Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:00.2110833Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:00.2111253Z GITHUB_BASE_REF=main 2025-05-07T19:43:00.2111646Z CI=true 2025-05-07T19:43:00.2111888Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:00.2112250Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:00.2112570Z GITHUB_ACTION_REF= 2025-05-07T19:43:00.2112906Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:00.2113473Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_709f67cf-745c-4d20-b536-d042ece5385a 2025-05-07T19:43:00.2113998Z MACHINE_NAME=x86_64 2025-05-07T19:43:00.2114295Z _=/usr/bin/printenv 2025-05-07T19:43:00.2114455Z 2025-05-07T19:43:00.2114590Z ################################################################################ 2025-05-07T19:43:00.2114968Z [INFO] Print ldd version ... 2025-05-07T19:43:00.2115258Z + ldd --version 2025-05-07T19:43:00.2115438Z 2025-05-07T19:43:00.2115560Z ldd (GNU libc) 2.34 2025-05-07T19:43:00.2115894Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:00.2116385Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:00.2117007Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:00.2117508Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:00.2117784Z 2025-05-07T19:43:00.2117916Z ################################################################################ 2025-05-07T19:43:00.2118263Z [INFO] Print CPU info ... 2025-05-07T19:43:00.2118554Z + nproc 2025-05-07T19:43:00.2118679Z 2025-05-07T19:43:00.2131042Z 96 2025-05-07T19:43:00.2131776Z 2025-05-07T19:43:00.2132248Z + lscpu 2025-05-07T19:43:00.2132568Z 2025-05-07T19:43:00.2411896Z Architecture: x86_64 2025-05-07T19:43:00.2413084Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:00.2414297Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2415521Z Byte Order: Little Endian 2025-05-07T19:43:00.2416282Z CPU(s): 96 2025-05-07T19:43:00.2416716Z On-line CPU(s) list: 0-95 2025-05-07T19:43:00.2417041Z Vendor ID: GenuineIntel 2025-05-07T19:43:00.2417457Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2417853Z CPU family: 6 2025-05-07T19:43:00.2418176Z Model: 85 2025-05-07T19:43:00.2418497Z Thread(s) per core: 2 2025-05-07T19:43:00.2418805Z Core(s) per socket: 24 2025-05-07T19:43:00.2419124Z Socket(s): 2 2025-05-07T19:43:00.2419414Z Stepping: 7 2025-05-07T19:43:00.2419747Z BogoMIPS: 5999.99 2025-05-07T19:43:00.2421990Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2430067Z Hypervisor vendor: KVM 2025-05-07T19:43:00.2430552Z Virtualization type: full 2025-05-07T19:43:00.2430970Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:00.2431550Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:00.2431985Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:00.2432431Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:00.2432834Z NUMA node(s): 2 2025-05-07T19:43:00.2433210Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:00.2433574Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:00.2434101Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:00.2434725Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:00.2435294Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:00.2435941Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:00.2436605Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:00.2437286Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:00.2437933Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:00.2438368Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:00.2438776Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:00.2439201Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:00.2439801Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:00.2440712Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:00.2441420Z Vulnerability Srbds: Not affected 2025-05-07T19:43:00.2441822Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:00.2442114Z 2025-05-07T19:43:00.2442215Z + cat /proc/cpuinfo 2025-05-07T19:43:00.2442367Z 2025-05-07T19:43:00.2442877Z processor : 0 2025-05-07T19:43:00.2443124Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2443426Z cpu family : 6 2025-05-07T19:43:00.2443655Z model : 85 2025-05-07T19:43:00.2443998Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2444388Z stepping : 7 2025-05-07T19:43:00.2444647Z microcode : 0x5003901 2025-05-07T19:43:00.2444931Z cpu MHz : 1200.898 2025-05-07T19:43:00.2445173Z cache size : 36608 KB 2025-05-07T19:43:00.2445434Z physical id : 0 2025-05-07T19:43:00.2445695Z siblings : 48 2025-05-07T19:43:00.2445911Z core id : 0 2025-05-07T19:43:00.2446113Z cpu cores : 24 2025-05-07T19:43:00.2446346Z apicid : 0 2025-05-07T19:43:00.2446546Z initial apicid : 0 2025-05-07T19:43:00.2446786Z fpu : yes 2025-05-07T19:43:00.2446987Z fpu_exception : yes 2025-05-07T19:43:00.2447227Z cpuid level : 13 2025-05-07T19:43:00.2447446Z wp : yes 2025-05-07T19:43:00.2449778Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2452532Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2453311Z bogomips : 5999.99 2025-05-07T19:43:00.2453569Z clflush size : 64 2025-05-07T19:43:00.2453858Z cache_alignment : 64 2025-05-07T19:43:00.2454158Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2454566Z power management: 2025-05-07T19:43:00.2454713Z 2025-05-07T19:43:00.2454802Z processor : 1 2025-05-07T19:43:00.2455048Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2455299Z cpu family : 6 2025-05-07T19:43:00.2455537Z model : 85 2025-05-07T19:43:00.2455835Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2456201Z stepping : 7 2025-05-07T19:43:00.2456429Z microcode : 0x5003901 2025-05-07T19:43:00.2456660Z cpu MHz : 1200.526 2025-05-07T19:43:00.2456895Z cache size : 36608 KB 2025-05-07T19:43:00.2457123Z physical id : 0 2025-05-07T19:43:00.2457349Z siblings : 48 2025-05-07T19:43:00.2457556Z core id : 1 2025-05-07T19:43:00.2457771Z cpu cores : 24 2025-05-07T19:43:00.2457976Z apicid : 2 2025-05-07T19:43:00.2458216Z initial apicid : 2 2025-05-07T19:43:00.2458456Z fpu : yes 2025-05-07T19:43:00.2458705Z fpu_exception : yes 2025-05-07T19:43:00.2458960Z cpuid level : 13 2025-05-07T19:43:00.2459218Z wp : yes 2025-05-07T19:43:00.2461548Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2464236Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2465049Z bogomips : 5999.99 2025-05-07T19:43:00.2465324Z clflush size : 64 2025-05-07T19:43:00.2465564Z cache_alignment : 64 2025-05-07T19:43:00.2465889Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2466240Z power management: 2025-05-07T19:43:00.2466416Z 2025-05-07T19:43:00.2466536Z processor : 2 2025-05-07T19:43:00.2466775Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2467070Z cpu family : 6 2025-05-07T19:43:00.2467298Z model : 85 2025-05-07T19:43:00.2467631Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2468046Z stepping : 7 2025-05-07T19:43:00.2468279Z microcode : 0x5003901 2025-05-07T19:43:00.2468567Z cpu MHz : 1199.735 2025-05-07T19:43:00.2468808Z cache size : 36608 KB 2025-05-07T19:43:00.2469083Z physical id : 0 2025-05-07T19:43:00.2469316Z siblings : 48 2025-05-07T19:43:00.2469570Z core id : 2 2025-05-07T19:43:00.2469791Z cpu cores : 24 2025-05-07T19:43:00.2470045Z apicid : 4 2025-05-07T19:43:00.2470267Z initial apicid : 4 2025-05-07T19:43:00.2470537Z fpu : yes 2025-05-07T19:43:00.2470771Z fpu_exception : yes 2025-05-07T19:43:00.2471046Z cpuid level : 13 2025-05-07T19:43:00.2471303Z wp : yes 2025-05-07T19:43:00.2473747Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2476481Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2477125Z bogomips : 5999.99 2025-05-07T19:43:00.2477362Z clflush size : 64 2025-05-07T19:43:00.2477778Z cache_alignment : 64 2025-05-07T19:43:00.2478087Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2478470Z power management: 2025-05-07T19:43:00.2478619Z 2025-05-07T19:43:00.2478720Z processor : 3 2025-05-07T19:43:00.2479086Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2479359Z cpu family : 6 2025-05-07T19:43:00.2479615Z model : 85 2025-05-07T19:43:00.2479946Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2480328Z stepping : 7 2025-05-07T19:43:00.2480588Z microcode : 0x5003901 2025-05-07T19:43:00.2480844Z cpu MHz : 1199.219 2025-05-07T19:43:00.2481118Z cache size : 36608 KB 2025-05-07T19:43:00.2481369Z physical id : 0 2025-05-07T19:43:00.2481623Z siblings : 48 2025-05-07T19:43:00.2481846Z core id : 3 2025-05-07T19:43:00.2482090Z cpu cores : 24 2025-05-07T19:43:00.2482316Z apicid : 6 2025-05-07T19:43:00.2482575Z initial apicid : 6 2025-05-07T19:43:00.2482840Z fpu : yes 2025-05-07T19:43:00.2483063Z fpu_exception : yes 2025-05-07T19:43:00.2483450Z cpuid level : 13 2025-05-07T19:43:00.2483672Z wp : yes 2025-05-07T19:43:00.2485852Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2488364Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2488946Z bogomips : 5999.99 2025-05-07T19:43:00.2489210Z clflush size : 64 2025-05-07T19:43:00.2489443Z cache_alignment : 64 2025-05-07T19:43:00.2489764Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2490130Z power management: 2025-05-07T19:43:00.2490271Z 2025-05-07T19:43:00.2490366Z processor : 4 2025-05-07T19:43:00.2490632Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2490894Z cpu family : 6 2025-05-07T19:43:00.2491141Z model : 85 2025-05-07T19:43:00.2491433Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2491823Z stepping : 7 2025-05-07T19:43:00.2492049Z microcode : 0x5003901 2025-05-07T19:43:00.2492322Z cpu MHz : 1200.586 2025-05-07T19:43:00.2492559Z cache size : 36608 KB 2025-05-07T19:43:00.2492836Z physical id : 0 2025-05-07T19:43:00.2493087Z siblings : 48 2025-05-07T19:43:00.2493314Z core id : 4 2025-05-07T19:43:00.2493566Z cpu cores : 24 2025-05-07T19:43:00.2493781Z apicid : 8 2025-05-07T19:43:00.2494007Z initial apicid : 8 2025-05-07T19:43:00.2494231Z fpu : yes 2025-05-07T19:43:00.2494465Z fpu_exception : yes 2025-05-07T19:43:00.2494688Z cpuid level : 13 2025-05-07T19:43:00.2494927Z wp : yes 2025-05-07T19:43:00.2497071Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2499579Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2500168Z bogomips : 5999.99 2025-05-07T19:43:00.2500393Z clflush size : 64 2025-05-07T19:43:00.2500609Z cache_alignment : 64 2025-05-07T19:43:00.2500877Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2501255Z power management: 2025-05-07T19:43:00.2501384Z 2025-05-07T19:43:00.2501485Z processor : 5 2025-05-07T19:43:00.2501692Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2501932Z cpu family : 6 2025-05-07T19:43:00.2502172Z model : 85 2025-05-07T19:43:00.2502450Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2502802Z stepping : 7 2025-05-07T19:43:00.2503042Z microcode : 0x5003901 2025-05-07T19:43:00.2503277Z cpu MHz : 1201.803 2025-05-07T19:43:00.2503527Z cache size : 36608 KB 2025-05-07T19:43:00.2503765Z physical id : 0 2025-05-07T19:43:00.2504011Z siblings : 48 2025-05-07T19:43:00.2504244Z core id : 5 2025-05-07T19:43:00.2504461Z cpu cores : 24 2025-05-07T19:43:00.2504698Z apicid : 10 2025-05-07T19:43:00.2504917Z initial apicid : 10 2025-05-07T19:43:00.2505166Z fpu : yes 2025-05-07T19:43:00.2505378Z fpu_exception : yes 2025-05-07T19:43:00.2505617Z cpuid level : 13 2025-05-07T19:43:00.2505822Z wp : yes 2025-05-07T19:43:00.2507951Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2510448Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2511017Z bogomips : 5999.99 2025-05-07T19:43:00.2511264Z clflush size : 64 2025-05-07T19:43:00.2511571Z cache_alignment : 64 2025-05-07T19:43:00.2512057Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2512527Z power management: 2025-05-07T19:43:00.2512678Z 2025-05-07T19:43:00.2512777Z processor : 6 2025-05-07T19:43:00.2513041Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2513301Z cpu family : 6 2025-05-07T19:43:00.2513553Z model : 85 2025-05-07T19:43:00.2513852Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2514251Z stepping : 7 2025-05-07T19:43:00.2514480Z microcode : 0x5003901 2025-05-07T19:43:00.2514756Z cpu MHz : 2999.996 2025-05-07T19:43:00.2514999Z cache size : 36608 KB 2025-05-07T19:43:00.2515272Z physical id : 0 2025-05-07T19:43:00.2515527Z siblings : 48 2025-05-07T19:43:00.2515752Z core id : 6 2025-05-07T19:43:00.2515993Z cpu cores : 24 2025-05-07T19:43:00.2516216Z apicid : 12 2025-05-07T19:43:00.2516465Z initial apicid : 12 2025-05-07T19:43:00.2516702Z fpu : yes 2025-05-07T19:43:00.2516944Z fpu_exception : yes 2025-05-07T19:43:00.2517184Z cpuid level : 13 2025-05-07T19:43:00.2517435Z wp : yes 2025-05-07T19:43:00.2519739Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2522463Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2523097Z bogomips : 5999.99 2025-05-07T19:43:00.2523339Z clflush size : 64 2025-05-07T19:43:00.2523607Z cache_alignment : 64 2025-05-07T19:43:00.2523936Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2524371Z power management: 2025-05-07T19:43:00.2524507Z 2025-05-07T19:43:00.2524700Z processor : 7 2025-05-07T19:43:00.2524930Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2525202Z cpu family : 6 2025-05-07T19:43:00.2525415Z model : 85 2025-05-07T19:43:00.2525724Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2526137Z stepping : 7 2025-05-07T19:43:00.2526383Z microcode : 0x5003901 2025-05-07T19:43:00.2526616Z cpu MHz : 1429.211 2025-05-07T19:43:00.2526876Z cache size : 36608 KB 2025-05-07T19:43:00.2527106Z physical id : 0 2025-05-07T19:43:00.2527362Z siblings : 48 2025-05-07T19:43:00.2527606Z core id : 7 2025-05-07T19:43:00.2527819Z cpu cores : 24 2025-05-07T19:43:00.2528063Z apicid : 14 2025-05-07T19:43:00.2528280Z initial apicid : 14 2025-05-07T19:43:00.2528535Z fpu : yes 2025-05-07T19:43:00.2528751Z fpu_exception : yes 2025-05-07T19:43:00.2529016Z cpuid level : 13 2025-05-07T19:43:00.2529239Z wp : yes 2025-05-07T19:43:00.2531397Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2534147Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2535089Z bogomips : 5999.99 2025-05-07T19:43:00.2535361Z clflush size : 64 2025-05-07T19:43:00.2535633Z cache_alignment : 64 2025-05-07T19:43:00.2536067Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2536440Z power management: 2025-05-07T19:43:00.2562216Z 2025-05-07T19:43:00.2562484Z processor : 8 2025-05-07T19:43:00.2562977Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2563373Z cpu family : 6 2025-05-07T19:43:00.2563742Z model : 85 2025-05-07T19:43:00.2564092Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2564485Z stepping : 7 2025-05-07T19:43:00.2565328Z microcode : 0x5003901 2025-05-07T19:43:00.2565587Z cpu MHz : 2999.996 2025-05-07T19:43:00.2565866Z cache size : 36608 KB 2025-05-07T19:43:00.2566123Z physical id : 0 2025-05-07T19:43:00.2566396Z siblings : 48 2025-05-07T19:43:00.2566630Z core id : 8 2025-05-07T19:43:00.2566894Z cpu cores : 24 2025-05-07T19:43:00.2567155Z apicid : 16 2025-05-07T19:43:00.2567392Z initial apicid : 16 2025-05-07T19:43:00.2567656Z fpu : yes 2025-05-07T19:43:00.2567914Z fpu_exception : yes 2025-05-07T19:43:00.2568189Z cpuid level : 13 2025-05-07T19:43:00.2568428Z wp : yes 2025-05-07T19:43:00.2570790Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2573540Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2574162Z bogomips : 5999.99 2025-05-07T19:43:00.2574431Z clflush size : 64 2025-05-07T19:43:00.2574676Z cache_alignment : 64 2025-05-07T19:43:00.2574999Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2575347Z power management: 2025-05-07T19:43:00.2575518Z 2025-05-07T19:43:00.2575614Z processor : 9 2025-05-07T19:43:00.2575875Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2576311Z cpu family : 6 2025-05-07T19:43:00.2576567Z model : 85 2025-05-07T19:43:00.2576867Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2577379Z stepping : 7 2025-05-07T19:43:00.2577605Z microcode : 0x5003901 2025-05-07T19:43:00.2577958Z cpu MHz : 1199.129 2025-05-07T19:43:00.2578200Z cache size : 36608 KB 2025-05-07T19:43:00.2578479Z physical id : 0 2025-05-07T19:43:00.2578716Z siblings : 48 2025-05-07T19:43:00.2578964Z core id : 9 2025-05-07T19:43:00.2579187Z cpu cores : 24 2025-05-07T19:43:00.2579443Z apicid : 18 2025-05-07T19:43:00.2579696Z initial apicid : 18 2025-05-07T19:43:00.2580036Z fpu : yes 2025-05-07T19:43:00.2580272Z fpu_exception : yes 2025-05-07T19:43:00.2580505Z cpuid level : 13 2025-05-07T19:43:00.2580744Z wp : yes 2025-05-07T19:43:00.2582898Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2585412Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2586012Z bogomips : 5999.99 2025-05-07T19:43:00.2586239Z clflush size : 64 2025-05-07T19:43:00.2586495Z cache_alignment : 64 2025-05-07T19:43:00.2586778Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2587165Z power management: 2025-05-07T19:43:00.2587309Z 2025-05-07T19:43:00.2587427Z processor : 10 2025-05-07T19:43:00.2587659Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2587930Z cpu family : 6 2025-05-07T19:43:00.2588145Z model : 85 2025-05-07T19:43:00.2588462Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2588820Z stepping : 7 2025-05-07T19:43:00.2589068Z microcode : 0x5003901 2025-05-07T19:43:00.2589305Z cpu MHz : 2999.996 2025-05-07T19:43:00.2589560Z cache size : 36608 KB 2025-05-07T19:43:00.2589799Z physical id : 0 2025-05-07T19:43:00.2590043Z siblings : 48 2025-05-07T19:43:00.2590260Z core id : 10 2025-05-07T19:43:00.2590501Z cpu cores : 24 2025-05-07T19:43:00.2590748Z apicid : 20 2025-05-07T19:43:00.2590959Z initial apicid : 20 2025-05-07T19:43:00.2591197Z fpu : yes 2025-05-07T19:43:00.2591497Z fpu_exception : yes 2025-05-07T19:43:00.2591751Z cpuid level : 13 2025-05-07T19:43:00.2592151Z wp : yes 2025-05-07T19:43:00.2594508Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2597225Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2597907Z bogomips : 5999.99 2025-05-07T19:43:00.2598169Z clflush size : 64 2025-05-07T19:43:00.2598403Z cache_alignment : 64 2025-05-07T19:43:00.2598723Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2599069Z power management: 2025-05-07T19:43:00.2599239Z 2025-05-07T19:43:00.2599334Z processor : 11 2025-05-07T19:43:00.2599599Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2599858Z cpu family : 6 2025-05-07T19:43:00.2600111Z model : 85 2025-05-07T19:43:00.2600412Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2600929Z stepping : 7 2025-05-07T19:43:00.2601155Z microcode : 0x5003901 2025-05-07T19:43:00.2601431Z cpu MHz : 2999.996 2025-05-07T19:43:00.2601675Z cache size : 36608 KB 2025-05-07T19:43:00.2602019Z physical id : 0 2025-05-07T19:43:00.2602260Z siblings : 48 2025-05-07T19:43:00.2602514Z core id : 11 2025-05-07T19:43:00.2602742Z cpu cores : 24 2025-05-07T19:43:00.2602995Z apicid : 22 2025-05-07T19:43:00.2603250Z initial apicid : 22 2025-05-07T19:43:00.2603491Z fpu : yes 2025-05-07T19:43:00.2603744Z fpu_exception : yes 2025-05-07T19:43:00.2603984Z cpuid level : 13 2025-05-07T19:43:00.2604335Z wp : yes 2025-05-07T19:43:00.2606470Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2608982Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2609580Z bogomips : 5999.99 2025-05-07T19:43:00.2609810Z clflush size : 64 2025-05-07T19:43:00.2610062Z cache_alignment : 64 2025-05-07T19:43:00.2610335Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2610686Z power management: 2025-05-07T19:43:00.2610823Z 2025-05-07T19:43:00.2610944Z processor : 12 2025-05-07T19:43:00.2611176Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2611453Z cpu family : 6 2025-05-07T19:43:00.2611668Z model : 85 2025-05-07T19:43:00.2611967Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2612324Z stepping : 7 2025-05-07T19:43:00.2612572Z microcode : 0x5003901 2025-05-07T19:43:00.2612812Z cpu MHz : 2999.996 2025-05-07T19:43:00.2613062Z cache size : 36608 KB 2025-05-07T19:43:00.2613285Z physical id : 0 2025-05-07T19:43:00.2613536Z siblings : 48 2025-05-07T19:43:00.2613758Z core id : 12 2025-05-07T19:43:00.2613995Z cpu cores : 24 2025-05-07T19:43:00.2614240Z apicid : 24 2025-05-07T19:43:00.2614459Z initial apicid : 24 2025-05-07T19:43:00.2614712Z fpu : yes 2025-05-07T19:43:00.2614922Z fpu_exception : yes 2025-05-07T19:43:00.2615173Z cpuid level : 13 2025-05-07T19:43:00.2615385Z wp : yes 2025-05-07T19:43:00.2617547Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2620049Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2620614Z bogomips : 5999.99 2025-05-07T19:43:00.2620856Z clflush size : 64 2025-05-07T19:43:00.2621073Z cache_alignment : 64 2025-05-07T19:43:00.2621367Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2621694Z power management: 2025-05-07T19:43:00.2621855Z 2025-05-07T19:43:00.2621947Z processor : 13 2025-05-07T19:43:00.2622185Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2622436Z cpu family : 6 2025-05-07T19:43:00.2622676Z model : 85 2025-05-07T19:43:00.2622955Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2623336Z stepping : 7 2025-05-07T19:43:00.2623618Z microcode : 0x5003901 2025-05-07T19:43:00.2623873Z cpu MHz : 2999.996 2025-05-07T19:43:00.2624106Z cache size : 36608 KB 2025-05-07T19:43:00.2624351Z physical id : 0 2025-05-07T19:43:00.2624583Z siblings : 48 2025-05-07T19:43:00.2624838Z core id : 13 2025-05-07T19:43:00.2625057Z cpu cores : 24 2025-05-07T19:43:00.2625251Z apicid : 26 2025-05-07T19:43:00.2625458Z initial apicid : 26 2025-05-07T19:43:00.2625663Z fpu : yes 2025-05-07T19:43:00.2625876Z fpu_exception : yes 2025-05-07T19:43:00.2626082Z cpuid level : 13 2025-05-07T19:43:00.2626287Z wp : yes 2025-05-07T19:43:00.2628407Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2630879Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2631513Z bogomips : 5999.99 2025-05-07T19:43:00.2631731Z clflush size : 64 2025-05-07T19:43:00.2632106Z cache_alignment : 64 2025-05-07T19:43:00.2632401Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2632725Z power management: 2025-05-07T19:43:00.2632887Z 2025-05-07T19:43:00.2632976Z processor : 14 2025-05-07T19:43:00.2633203Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2633455Z cpu family : 6 2025-05-07T19:43:00.2633659Z model : 85 2025-05-07T19:43:00.2633943Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2634301Z stepping : 7 2025-05-07T19:43:00.2634520Z microcode : 0x5003901 2025-05-07T19:43:00.2634765Z cpu MHz : 2999.996 2025-05-07T19:43:00.2634982Z cache size : 36608 KB 2025-05-07T19:43:00.2635217Z physical id : 0 2025-05-07T19:43:00.2635423Z siblings : 48 2025-05-07T19:43:00.2635638Z core id : 14 2025-05-07T19:43:00.2635839Z cpu cores : 24 2025-05-07T19:43:00.2636068Z apicid : 28 2025-05-07T19:43:00.2636281Z initial apicid : 28 2025-05-07T19:43:00.2636503Z fpu : yes 2025-05-07T19:43:00.2636706Z fpu_exception : yes 2025-05-07T19:43:00.2636945Z cpuid level : 13 2025-05-07T19:43:00.2637150Z wp : yes 2025-05-07T19:43:00.2639449Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2642119Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2642708Z bogomips : 5999.99 2025-05-07T19:43:00.2642915Z clflush size : 64 2025-05-07T19:43:00.2643131Z cache_alignment : 64 2025-05-07T19:43:00.2643396Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2643721Z power management: 2025-05-07T19:43:00.2643893Z 2025-05-07T19:43:00.2643976Z processor : 15 2025-05-07T19:43:00.2644213Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2644463Z cpu family : 6 2025-05-07T19:43:00.2644690Z model : 85 2025-05-07T19:43:00.2644965Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2645324Z stepping : 7 2025-05-07T19:43:00.2645540Z microcode : 0x5003901 2025-05-07T19:43:00.2645764Z cpu MHz : 2999.996 2025-05-07T19:43:00.2646068Z cache size : 36608 KB 2025-05-07T19:43:00.2646308Z physical id : 0 2025-05-07T19:43:00.2646544Z siblings : 48 2025-05-07T19:43:00.2646760Z core id : 15 2025-05-07T19:43:00.2646983Z cpu cores : 24 2025-05-07T19:43:00.2647188Z apicid : 30 2025-05-07T19:43:00.2647461Z initial apicid : 30 2025-05-07T19:43:00.2647668Z fpu : yes 2025-05-07T19:43:00.2647889Z fpu_exception : yes 2025-05-07T19:43:00.2648117Z cpuid level : 13 2025-05-07T19:43:00.2648357Z wp : yes 2025-05-07T19:43:00.2650651Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2653328Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2653942Z bogomips : 5999.99 2025-05-07T19:43:00.2654168Z clflush size : 64 2025-05-07T19:43:00.2654384Z cache_alignment : 64 2025-05-07T19:43:00.2654828Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2655162Z power management: 2025-05-07T19:43:00.2655311Z 2025-05-07T19:43:00.2655400Z processor : 16 2025-05-07T19:43:00.2655623Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2655883Z cpu family : 6 2025-05-07T19:43:00.2656087Z model : 85 2025-05-07T19:43:00.2656376Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2656748Z stepping : 7 2025-05-07T19:43:00.2656970Z microcode : 0x5003901 2025-05-07T19:43:00.2657218Z cpu MHz : 1199.809 2025-05-07T19:43:00.2657441Z cache size : 36608 KB 2025-05-07T19:43:00.2657680Z physical id : 0 2025-05-07T19:43:00.2657898Z siblings : 48 2025-05-07T19:43:00.2658111Z core id : 16 2025-05-07T19:43:00.2658323Z cpu cores : 24 2025-05-07T19:43:00.2658557Z apicid : 32 2025-05-07T19:43:00.2658766Z initial apicid : 32 2025-05-07T19:43:00.2659007Z fpu : yes 2025-05-07T19:43:00.2659215Z fpu_exception : yes 2025-05-07T19:43:00.2659457Z cpuid level : 13 2025-05-07T19:43:00.2659668Z wp : yes 2025-05-07T19:43:00.2662137Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2665036Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2665740Z bogomips : 5999.99 2025-05-07T19:43:00.2665955Z clflush size : 64 2025-05-07T19:43:00.2666184Z cache_alignment : 64 2025-05-07T19:43:00.2666458Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2666790Z power management: 2025-05-07T19:43:00.2666928Z 2025-05-07T19:43:00.2667031Z processor : 17 2025-05-07T19:43:00.2667245Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2667491Z cpu family : 6 2025-05-07T19:43:00.2667688Z model : 85 2025-05-07T19:43:00.2667965Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2668315Z stepping : 7 2025-05-07T19:43:00.2668540Z microcode : 0x5003901 2025-05-07T19:43:00.2668766Z cpu MHz : 1199.852 2025-05-07T19:43:00.2669009Z cache size : 36608 KB 2025-05-07T19:43:00.2669233Z physical id : 0 2025-05-07T19:43:00.2669572Z siblings : 48 2025-05-07T19:43:00.2669775Z core id : 17 2025-05-07T19:43:00.2669986Z cpu cores : 24 2025-05-07T19:43:00.2670183Z apicid : 34 2025-05-07T19:43:00.2670407Z initial apicid : 34 2025-05-07T19:43:00.2670638Z fpu : yes 2025-05-07T19:43:00.2670913Z fpu_exception : yes 2025-05-07T19:43:00.2671159Z cpuid level : 13 2025-05-07T19:43:00.2671506Z wp : yes 2025-05-07T19:43:00.2673825Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2676555Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2677188Z bogomips : 5999.99 2025-05-07T19:43:00.2677423Z clflush size : 64 2025-05-07T19:43:00.2677684Z cache_alignment : 64 2025-05-07T19:43:00.2677976Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2678338Z power management: 2025-05-07T19:43:00.2678482Z 2025-05-07T19:43:00.2678577Z processor : 18 2025-05-07T19:43:00.2678841Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2679098Z cpu family : 6 2025-05-07T19:43:00.2679342Z model : 85 2025-05-07T19:43:00.2679656Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2680029Z stepping : 7 2025-05-07T19:43:00.2680279Z microcode : 0x5003901 2025-05-07T19:43:00.2680526Z cpu MHz : 1199.362 2025-05-07T19:43:00.2680790Z cache size : 36608 KB 2025-05-07T19:43:00.2681038Z physical id : 0 2025-05-07T19:43:00.2681290Z siblings : 48 2025-05-07T19:43:00.2681522Z core id : 18 2025-05-07T19:43:00.2681779Z cpu cores : 24 2025-05-07T19:43:00.2682008Z apicid : 36 2025-05-07T19:43:00.2682265Z initial apicid : 36 2025-05-07T19:43:00.2682496Z fpu : yes 2025-05-07T19:43:00.2682745Z fpu_exception : yes 2025-05-07T19:43:00.2682986Z cpuid level : 13 2025-05-07T19:43:00.2683234Z wp : yes 2025-05-07T19:43:00.2685502Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2688014Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2688588Z bogomips : 5999.99 2025-05-07T19:43:00.2688837Z clflush size : 64 2025-05-07T19:43:00.2689064Z cache_alignment : 64 2025-05-07T19:43:00.2689373Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2689700Z power management: 2025-05-07T19:43:00.2689860Z 2025-05-07T19:43:00.2689952Z processor : 19 2025-05-07T19:43:00.2690179Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2690448Z cpu family : 6 2025-05-07T19:43:00.2690658Z model : 85 2025-05-07T19:43:00.2690964Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2691330Z stepping : 7 2025-05-07T19:43:00.2691543Z microcode : 0x5003901 2025-05-07T19:43:00.2691795Z cpu MHz : 2999.996 2025-05-07T19:43:00.2692015Z cache size : 36608 KB 2025-05-07T19:43:00.2692270Z physical id : 0 2025-05-07T19:43:00.2692484Z siblings : 48 2025-05-07T19:43:00.2692719Z core id : 19 2025-05-07T19:43:00.2692930Z cpu cores : 24 2025-05-07T19:43:00.2693267Z apicid : 38 2025-05-07T19:43:00.2693482Z initial apicid : 38 2025-05-07T19:43:00.2693729Z fpu : yes 2025-05-07T19:43:00.2693940Z fpu_exception : yes 2025-05-07T19:43:00.2694193Z cpuid level : 13 2025-05-07T19:43:00.2694434Z wp : yes 2025-05-07T19:43:00.2696622Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2699143Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2699744Z bogomips : 5999.99 2025-05-07T19:43:00.2699969Z clflush size : 64 2025-05-07T19:43:00.2700221Z cache_alignment : 64 2025-05-07T19:43:00.2700495Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2700848Z power management: 2025-05-07T19:43:00.2700984Z 2025-05-07T19:43:00.2701077Z processor : 20 2025-05-07T19:43:00.2701326Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2701571Z cpu family : 6 2025-05-07T19:43:00.2701806Z model : 85 2025-05-07T19:43:00.2702107Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2702456Z stepping : 7 2025-05-07T19:43:00.2702703Z microcode : 0x5003901 2025-05-07T19:43:00.2702936Z cpu MHz : 2999.996 2025-05-07T19:43:00.2703186Z cache size : 36608 KB 2025-05-07T19:43:00.2703422Z physical id : 0 2025-05-07T19:43:00.2703668Z siblings : 48 2025-05-07T19:43:00.2703879Z core id : 20 2025-05-07T19:43:00.2704119Z cpu cores : 24 2025-05-07T19:43:00.2704330Z apicid : 40 2025-05-07T19:43:00.2704580Z initial apicid : 40 2025-05-07T19:43:00.2704801Z fpu : yes 2025-05-07T19:43:00.2705036Z fpu_exception : yes 2025-05-07T19:43:00.2705263Z cpuid level : 13 2025-05-07T19:43:00.2705506Z wp : yes 2025-05-07T19:43:00.2707677Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2710181Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2710757Z bogomips : 5999.99 2025-05-07T19:43:00.2711006Z clflush size : 64 2025-05-07T19:43:00.2711240Z cache_alignment : 64 2025-05-07T19:43:00.2711618Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2712126Z power management: 2025-05-07T19:43:00.2712308Z 2025-05-07T19:43:00.2712409Z processor : 21 2025-05-07T19:43:00.2712662Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2712991Z cpu family : 6 2025-05-07T19:43:00.2713224Z model : 85 2025-05-07T19:43:00.2713551Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2713947Z stepping : 7 2025-05-07T19:43:00.2714177Z microcode : 0x5003901 2025-05-07T19:43:00.2714449Z cpu MHz : 1199.180 2025-05-07T19:43:00.2714686Z cache size : 36608 KB 2025-05-07T19:43:00.2714960Z physical id : 0 2025-05-07T19:43:00.2715188Z siblings : 48 2025-05-07T19:43:00.2715440Z core id : 21 2025-05-07T19:43:00.2715663Z cpu cores : 24 2025-05-07T19:43:00.2715914Z apicid : 42 2025-05-07T19:43:00.2716140Z initial apicid : 42 2025-05-07T19:43:00.2716479Z fpu : yes 2025-05-07T19:43:00.2716704Z fpu_exception : yes 2025-05-07T19:43:00.2716973Z cpuid level : 13 2025-05-07T19:43:00.2717229Z wp : yes 2025-05-07T19:43:00.2719583Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2722278Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2722913Z bogomips : 5999.99 2025-05-07T19:43:00.2723157Z clflush size : 64 2025-05-07T19:43:00.2723420Z cache_alignment : 64 2025-05-07T19:43:00.2723716Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2724195Z power management: 2025-05-07T19:43:00.2724333Z 2025-05-07T19:43:00.2724429Z processor : 22 2025-05-07T19:43:00.2724675Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2724919Z cpu family : 6 2025-05-07T19:43:00.2725155Z model : 85 2025-05-07T19:43:00.2725459Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2725813Z stepping : 7 2025-05-07T19:43:00.2726056Z microcode : 0x5003901 2025-05-07T19:43:00.2726302Z cpu MHz : 1200.581 2025-05-07T19:43:00.2726558Z cache size : 36608 KB 2025-05-07T19:43:00.2726790Z physical id : 0 2025-05-07T19:43:00.2727040Z siblings : 48 2025-05-07T19:43:00.2727249Z core id : 22 2025-05-07T19:43:00.2727479Z cpu cores : 24 2025-05-07T19:43:00.2727717Z apicid : 44 2025-05-07T19:43:00.2727952Z initial apicid : 44 2025-05-07T19:43:00.2728172Z fpu : yes 2025-05-07T19:43:00.2728410Z fpu_exception : yes 2025-05-07T19:43:00.2728642Z cpuid level : 13 2025-05-07T19:43:00.2728921Z wp : yes 2025-05-07T19:43:00.2731099Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2733936Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2734597Z bogomips : 5999.99 2025-05-07T19:43:00.2734856Z clflush size : 64 2025-05-07T19:43:00.2735084Z cache_alignment : 64 2025-05-07T19:43:00.2735385Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2735709Z power management: 2025-05-07T19:43:00.2735866Z 2025-05-07T19:43:00.2735956Z processor : 23 2025-05-07T19:43:00.2736207Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2736446Z cpu family : 6 2025-05-07T19:43:00.2736675Z model : 85 2025-05-07T19:43:00.2736948Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2737324Z stepping : 7 2025-05-07T19:43:00.2737535Z microcode : 0x5003901 2025-05-07T19:43:00.2737789Z cpu MHz : 2999.996 2025-05-07T19:43:00.2738006Z cache size : 36608 KB 2025-05-07T19:43:00.2738257Z physical id : 0 2025-05-07T19:43:00.2738488Z siblings : 48 2025-05-07T19:43:00.2738690Z core id : 23 2025-05-07T19:43:00.2738918Z cpu cores : 24 2025-05-07T19:43:00.2739126Z apicid : 46 2025-05-07T19:43:00.2739336Z initial apicid : 46 2025-05-07T19:43:00.2739538Z fpu : yes 2025-05-07T19:43:00.2739742Z fpu_exception : yes 2025-05-07T19:43:00.2739947Z cpuid level : 13 2025-05-07T19:43:00.2742372Z wp : yes 2025-05-07T19:43:00.2744587Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2747050Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2747634Z bogomips : 5999.99 2025-05-07T19:43:00.2747863Z clflush size : 64 2025-05-07T19:43:00.2748071Z cache_alignment : 64 2025-05-07T19:43:00.2748343Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2748657Z power management: 2025-05-07T19:43:00.2748782Z 2025-05-07T19:43:00.2748872Z processor : 24 2025-05-07T19:43:00.2749081Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2749341Z cpu family : 6 2025-05-07T19:43:00.2749540Z model : 85 2025-05-07T19:43:00.2749822Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2750159Z stepping : 7 2025-05-07T19:43:00.2750384Z microcode : 0x5003901 2025-05-07T19:43:00.2750618Z cpu MHz : 2999.996 2025-05-07T19:43:00.2750839Z cache size : 36608 KB 2025-05-07T19:43:00.2751064Z physical id : 1 2025-05-07T19:43:00.2751263Z siblings : 48 2025-05-07T19:43:00.2751562Z core id : 0 2025-05-07T19:43:00.2751921Z cpu cores : 24 2025-05-07T19:43:00.2752148Z apicid : 64 2025-05-07T19:43:00.2752359Z initial apicid : 64 2025-05-07T19:43:00.2752590Z fpu : yes 2025-05-07T19:43:00.2752850Z fpu_exception : yes 2025-05-07T19:43:00.2753094Z cpuid level : 13 2025-05-07T19:43:00.2753308Z wp : yes 2025-05-07T19:43:00.2755617Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2758287Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2758882Z bogomips : 5999.99 2025-05-07T19:43:00.2759124Z clflush size : 64 2025-05-07T19:43:00.2759363Z cache_alignment : 64 2025-05-07T19:43:00.2759655Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2760008Z power management: 2025-05-07T19:43:00.2760144Z 2025-05-07T19:43:00.2760230Z processor : 25 2025-05-07T19:43:00.2760466Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2760711Z cpu family : 6 2025-05-07T19:43:00.2760930Z model : 85 2025-05-07T19:43:00.2761218Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2761589Z stepping : 7 2025-05-07T19:43:00.2761801Z microcode : 0x5003901 2025-05-07T19:43:00.2762045Z cpu MHz : 2999.996 2025-05-07T19:43:00.2762282Z cache size : 36608 KB 2025-05-07T19:43:00.2762508Z physical id : 1 2025-05-07T19:43:00.2762737Z siblings : 48 2025-05-07T19:43:00.2762941Z core id : 1 2025-05-07T19:43:00.2763156Z cpu cores : 24 2025-05-07T19:43:00.2763352Z apicid : 66 2025-05-07T19:43:00.2763572Z initial apicid : 66 2025-05-07T19:43:00.2763798Z fpu : yes 2025-05-07T19:43:00.2763992Z fpu_exception : yes 2025-05-07T19:43:00.2764227Z cpuid level : 13 2025-05-07T19:43:00.2764450Z wp : yes 2025-05-07T19:43:00.2767034Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2769841Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2770462Z bogomips : 5999.99 2025-05-07T19:43:00.2770732Z clflush size : 64 2025-05-07T19:43:00.2770972Z cache_alignment : 64 2025-05-07T19:43:00.2771294Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2771647Z power management: 2025-05-07T19:43:00.2771824Z 2025-05-07T19:43:00.2771926Z processor : 26 2025-05-07T19:43:00.2772195Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2772460Z cpu family : 6 2025-05-07T19:43:00.2772713Z model : 85 2025-05-07T19:43:00.2773018Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2773411Z stepping : 7 2025-05-07T19:43:00.2773642Z microcode : 0x5003901 2025-05-07T19:43:00.2773918Z cpu MHz : 2999.996 2025-05-07T19:43:00.2774153Z cache size : 36608 KB 2025-05-07T19:43:00.2774421Z physical id : 1 2025-05-07T19:43:00.2774657Z siblings : 48 2025-05-07T19:43:00.2774904Z core id : 2 2025-05-07T19:43:00.2775128Z cpu cores : 24 2025-05-07T19:43:00.2775372Z apicid : 68 2025-05-07T19:43:00.2775626Z initial apicid : 68 2025-05-07T19:43:00.2775868Z fpu : yes 2025-05-07T19:43:00.2776109Z fpu_exception : yes 2025-05-07T19:43:00.2776346Z cpuid level : 13 2025-05-07T19:43:00.2776591Z wp : yes 2025-05-07T19:43:00.2778820Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2781339Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2781926Z bogomips : 5999.99 2025-05-07T19:43:00.2782151Z clflush size : 64 2025-05-07T19:43:00.2782397Z cache_alignment : 64 2025-05-07T19:43:00.2782670Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2783012Z power management: 2025-05-07T19:43:00.2783150Z 2025-05-07T19:43:00.2783273Z processor : 27 2025-05-07T19:43:00.2783501Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2783767Z cpu family : 6 2025-05-07T19:43:00.2783973Z model : 85 2025-05-07T19:43:00.2784271Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2784623Z stepping : 7 2025-05-07T19:43:00.2784860Z microcode : 0x5003901 2025-05-07T19:43:00.2785090Z cpu MHz : 2999.996 2025-05-07T19:43:00.2785331Z cache size : 36608 KB 2025-05-07T19:43:00.2785563Z physical id : 1 2025-05-07T19:43:00.2785805Z siblings : 48 2025-05-07T19:43:00.2786014Z core id : 3 2025-05-07T19:43:00.2786245Z cpu cores : 24 2025-05-07T19:43:00.2786456Z apicid : 70 2025-05-07T19:43:00.2786698Z initial apicid : 70 2025-05-07T19:43:00.2786945Z fpu : yes 2025-05-07T19:43:00.2787154Z fpu_exception : yes 2025-05-07T19:43:00.2787412Z cpuid level : 13 2025-05-07T19:43:00.2787631Z wp : yes 2025-05-07T19:43:00.2789861Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2792723Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2793333Z bogomips : 5999.99 2025-05-07T19:43:00.2793593Z clflush size : 64 2025-05-07T19:43:00.2793838Z cache_alignment : 64 2025-05-07T19:43:00.2794160Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2794513Z power management: 2025-05-07T19:43:00.2794682Z 2025-05-07T19:43:00.2794778Z processor : 28 2025-05-07T19:43:00.2795043Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2795307Z cpu family : 6 2025-05-07T19:43:00.2795554Z model : 85 2025-05-07T19:43:00.2795854Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2796254Z stepping : 7 2025-05-07T19:43:00.2796491Z microcode : 0x5003901 2025-05-07T19:43:00.2796764Z cpu MHz : 3183.208 2025-05-07T19:43:00.2797000Z cache size : 36608 KB 2025-05-07T19:43:00.2797271Z physical id : 1 2025-05-07T19:43:00.2797503Z siblings : 48 2025-05-07T19:43:00.2797749Z core id : 4 2025-05-07T19:43:00.2797966Z cpu cores : 24 2025-05-07T19:43:00.2798213Z apicid : 72 2025-05-07T19:43:00.2798466Z initial apicid : 72 2025-05-07T19:43:00.2798697Z fpu : yes 2025-05-07T19:43:00.2798932Z fpu_exception : yes 2025-05-07T19:43:00.2799167Z cpuid level : 13 2025-05-07T19:43:00.2799421Z wp : yes 2025-05-07T19:43:00.2801735Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2804524Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2805099Z bogomips : 5999.99 2025-05-07T19:43:00.2805326Z clflush size : 64 2025-05-07T19:43:00.2805555Z cache_alignment : 64 2025-05-07T19:43:00.2805815Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2806139Z power management: 2025-05-07T19:43:00.2806267Z 2025-05-07T19:43:00.2806351Z processor : 29 2025-05-07T19:43:00.2806568Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2806799Z cpu family : 6 2025-05-07T19:43:00.2807014Z model : 85 2025-05-07T19:43:00.2807280Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2807632Z stepping : 7 2025-05-07T19:43:00.2807841Z microcode : 0x5003901 2025-05-07T19:43:00.2808075Z cpu MHz : 2999.996 2025-05-07T19:43:00.2808307Z cache size : 36608 KB 2025-05-07T19:43:00.2808525Z physical id : 1 2025-05-07T19:43:00.2808737Z siblings : 48 2025-05-07T19:43:00.2808940Z core id : 5 2025-05-07T19:43:00.2809151Z cpu cores : 24 2025-05-07T19:43:00.2809352Z apicid : 74 2025-05-07T19:43:00.2809576Z initial apicid : 74 2025-05-07T19:43:00.2809792Z fpu : yes 2025-05-07T19:43:00.2810012Z fpu_exception : yes 2025-05-07T19:43:00.2810225Z cpuid level : 13 2025-05-07T19:43:00.2810449Z wp : yes 2025-05-07T19:43:00.2812677Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2815191Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2815765Z bogomips : 5999.99 2025-05-07T19:43:00.2815986Z clflush size : 64 2025-05-07T19:43:00.2816199Z cache_alignment : 64 2025-05-07T19:43:00.2816476Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2816782Z power management: 2025-05-07T19:43:00.2816913Z 2025-05-07T19:43:00.2817011Z processor : 30 2025-05-07T19:43:00.2817221Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2817456Z cpu family : 6 2025-05-07T19:43:00.2817650Z model : 85 2025-05-07T19:43:00.2817933Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2818262Z stepping : 7 2025-05-07T19:43:00.2818480Z microcode : 0x5003901 2025-05-07T19:43:00.2818687Z cpu MHz : 2999.996 2025-05-07T19:43:00.2818892Z cache size : 36608 KB 2025-05-07T19:43:00.2819118Z physical id : 1 2025-05-07T19:43:00.2819324Z siblings : 48 2025-05-07T19:43:00.2819544Z core id : 6 2025-05-07T19:43:00.2819735Z cpu cores : 24 2025-05-07T19:43:00.2819950Z apicid : 76 2025-05-07T19:43:00.2820145Z initial apicid : 76 2025-05-07T19:43:00.2820374Z fpu : yes 2025-05-07T19:43:00.2820568Z fpu_exception : yes 2025-05-07T19:43:00.2820799Z cpuid level : 13 2025-05-07T19:43:00.2820997Z wp : yes 2025-05-07T19:43:00.2823168Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2825658Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2826217Z bogomips : 5999.99 2025-05-07T19:43:00.2826445Z clflush size : 64 2025-05-07T19:43:00.2826649Z cache_alignment : 64 2025-05-07T19:43:00.2826892Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2827217Z power management: 2025-05-07T19:43:00.2827341Z 2025-05-07T19:43:00.2827417Z processor : 31 2025-05-07T19:43:00.2827621Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2827848Z cpu family : 6 2025-05-07T19:43:00.2828045Z model : 85 2025-05-07T19:43:00.2828302Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2828649Z stepping : 7 2025-05-07T19:43:00.2828844Z microcode : 0x5003901 2025-05-07T19:43:00.2829061Z cpu MHz : 2999.996 2025-05-07T19:43:00.2829277Z cache size : 36608 KB 2025-05-07T19:43:00.2829492Z physical id : 1 2025-05-07T19:43:00.2829709Z siblings : 48 2025-05-07T19:43:00.2829894Z core id : 7 2025-05-07T19:43:00.2830100Z cpu cores : 24 2025-05-07T19:43:00.2830290Z apicid : 78 2025-05-07T19:43:00.2830496Z initial apicid : 78 2025-05-07T19:43:00.2830690Z fpu : yes 2025-05-07T19:43:00.2830896Z fpu_exception : yes 2025-05-07T19:43:00.2831094Z cpuid level : 13 2025-05-07T19:43:00.2831302Z wp : yes 2025-05-07T19:43:00.2833897Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2836622Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2837227Z bogomips : 5999.99 2025-05-07T19:43:00.2837464Z clflush size : 64 2025-05-07T19:43:00.2837677Z cache_alignment : 64 2025-05-07T19:43:00.2837979Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2838302Z power management: 2025-05-07T19:43:00.2838440Z 2025-05-07T19:43:00.2838535Z processor : 32 2025-05-07T19:43:00.2838751Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2839002Z cpu family : 6 2025-05-07T19:43:00.2839203Z model : 85 2025-05-07T19:43:00.2839499Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2839850Z stepping : 7 2025-05-07T19:43:00.2840081Z microcode : 0x5003901 2025-05-07T19:43:00.2840321Z cpu MHz : 2999.996 2025-05-07T19:43:00.2840533Z cache size : 36608 KB 2025-05-07T19:43:00.2840766Z physical id : 1 2025-05-07T19:43:00.2840982Z siblings : 48 2025-05-07T19:43:00.2841198Z core id : 8 2025-05-07T19:43:00.2841407Z cpu cores : 24 2025-05-07T19:43:00.2841633Z apicid : 80 2025-05-07T19:43:00.2841830Z initial apicid : 80 2025-05-07T19:43:00.2842058Z fpu : yes 2025-05-07T19:43:00.2842261Z fpu_exception : yes 2025-05-07T19:43:00.2842499Z cpuid level : 13 2025-05-07T19:43:00.2842715Z wp : yes 2025-05-07T19:43:00.2845062Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2847526Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2848069Z bogomips : 5999.99 2025-05-07T19:43:00.2848278Z clflush size : 64 2025-05-07T19:43:00.2848482Z cache_alignment : 64 2025-05-07T19:43:00.2848728Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2849035Z power management: 2025-05-07T19:43:00.2849161Z 2025-05-07T19:43:00.2849237Z processor : 33 2025-05-07T19:43:00.2849437Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2849654Z cpu family : 6 2025-05-07T19:43:00.2849843Z model : 85 2025-05-07T19:43:00.2850094Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2850422Z stepping : 7 2025-05-07T19:43:00.2850602Z microcode : 0x5003901 2025-05-07T19:43:00.2850823Z cpu MHz : 2999.996 2025-05-07T19:43:00.2851027Z cache size : 36608 KB 2025-05-07T19:43:00.2851226Z physical id : 1 2025-05-07T19:43:00.2851430Z siblings : 48 2025-05-07T19:43:00.2851626Z core id : 9 2025-05-07T19:43:00.2851827Z cpu cores : 24 2025-05-07T19:43:00.2852018Z apicid : 82 2025-05-07T19:43:00.2852214Z initial apicid : 82 2025-05-07T19:43:00.2852415Z fpu : yes 2025-05-07T19:43:00.2852620Z fpu_exception : yes 2025-05-07T19:43:00.2852820Z cpuid level : 13 2025-05-07T19:43:00.2853009Z wp : yes 2025-05-07T19:43:00.2855109Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2857665Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2858211Z bogomips : 5999.99 2025-05-07T19:43:00.2858416Z clflush size : 64 2025-05-07T19:43:00.2858608Z cache_alignment : 64 2025-05-07T19:43:00.2858858Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2859148Z power management: 2025-05-07T19:43:00.2859271Z 2025-05-07T19:43:00.2859355Z processor : 34 2025-05-07T19:43:00.2859550Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2859763Z cpu family : 6 2025-05-07T19:43:00.2859945Z model : 85 2025-05-07T19:43:00.2860204Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2860522Z stepping : 7 2025-05-07T19:43:00.2860720Z microcode : 0x5003901 2025-05-07T19:43:00.2860931Z cpu MHz : 2999.996 2025-05-07T19:43:00.2861128Z cache size : 36608 KB 2025-05-07T19:43:00.2861343Z physical id : 1 2025-05-07T19:43:00.2861529Z siblings : 48 2025-05-07T19:43:00.2861726Z core id : 10 2025-05-07T19:43:00.2861915Z cpu cores : 24 2025-05-07T19:43:00.2862138Z apicid : 84 2025-05-07T19:43:00.2862334Z initial apicid : 84 2025-05-07T19:43:00.2862545Z fpu : yes 2025-05-07T19:43:00.2862731Z fpu_exception : yes 2025-05-07T19:43:00.2862955Z cpuid level : 13 2025-05-07T19:43:00.2863036Z wp : yes 2025-05-07T19:43:00.2865369Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2865784Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2865879Z bogomips : 5999.99 2025-05-07T19:43:00.2866071Z clflush size : 64 2025-05-07T19:43:00.2866161Z cache_alignment : 64 2025-05-07T19:43:00.2866296Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2866386Z power management: 2025-05-07T19:43:00.2866390Z 2025-05-07T19:43:00.2866490Z processor : 35 2025-05-07T19:43:00.2866589Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2866671Z cpu family : 6 2025-05-07T19:43:00.2866767Z model : 85 2025-05-07T19:43:00.2866938Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2867020Z stepping : 7 2025-05-07T19:43:00.2867110Z microcode : 0x5003901 2025-05-07T19:43:00.2867206Z cpu MHz : 2999.996 2025-05-07T19:43:00.2867303Z cache size : 36608 KB 2025-05-07T19:43:00.2867393Z physical id : 1 2025-05-07T19:43:00.2867492Z siblings : 48 2025-05-07T19:43:00.2867574Z core id : 11 2025-05-07T19:43:00.2867662Z cpu cores : 24 2025-05-07T19:43:00.2867742Z apicid : 86 2025-05-07T19:43:00.2867850Z initial apicid : 86 2025-05-07T19:43:00.2867935Z fpu : yes 2025-05-07T19:43:00.2868028Z fpu_exception : yes 2025-05-07T19:43:00.2868131Z cpuid level : 13 2025-05-07T19:43:00.2868217Z wp : yes 2025-05-07T19:43:00.2870399Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2870907Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2870996Z bogomips : 5999.99 2025-05-07T19:43:00.2871145Z clflush size : 64 2025-05-07T19:43:00.2871261Z cache_alignment : 64 2025-05-07T19:43:00.2871468Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2871560Z power management: 2025-05-07T19:43:00.2871564Z 2025-05-07T19:43:00.2871658Z processor : 36 2025-05-07T19:43:00.2871774Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2871857Z cpu family : 6 2025-05-07T19:43:00.2871943Z model : 85 2025-05-07T19:43:00.2872136Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2872225Z stepping : 7 2025-05-07T19:43:00.2872315Z microcode : 0x5003901 2025-05-07T19:43:00.2872404Z cpu MHz : 2999.996 2025-05-07T19:43:00.2872516Z cache size : 36608 KB 2025-05-07T19:43:00.2872605Z physical id : 1 2025-05-07T19:43:00.2872695Z siblings : 48 2025-05-07T19:43:00.2872798Z core id : 12 2025-05-07T19:43:00.2872884Z cpu cores : 24 2025-05-07T19:43:00.2872970Z apicid : 88 2025-05-07T19:43:00.2873063Z initial apicid : 88 2025-05-07T19:43:00.2873161Z fpu : yes 2025-05-07T19:43:00.2873251Z fpu_exception : yes 2025-05-07T19:43:00.2873337Z cpuid level : 13 2025-05-07T19:43:00.2873438Z wp : yes 2025-05-07T19:43:00.2875606Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2876010Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2876118Z bogomips : 5999.99 2025-05-07T19:43:00.2876203Z clflush size : 64 2025-05-07T19:43:00.2876299Z cache_alignment : 64 2025-05-07T19:43:00.2876452Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2876545Z power management: 2025-05-07T19:43:00.2876550Z 2025-05-07T19:43:00.2876634Z processor : 37 2025-05-07T19:43:00.2876724Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2876826Z cpu family : 6 2025-05-07T19:43:00.2876906Z model : 85 2025-05-07T19:43:00.2877074Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2877172Z stepping : 7 2025-05-07T19:43:00.2877267Z microcode : 0x5003901 2025-05-07T19:43:00.2877350Z cpu MHz : 3354.414 2025-05-07T19:43:00.2877437Z cache size : 36608 KB 2025-05-07T19:43:00.2877538Z physical id : 1 2025-05-07T19:43:00.2877622Z siblings : 48 2025-05-07T19:43:00.2877709Z core id : 13 2025-05-07T19:43:00.2877815Z cpu cores : 24 2025-05-07T19:43:00.2877898Z apicid : 90 2025-05-07T19:43:00.2877984Z initial apicid : 90 2025-05-07T19:43:00.2878067Z fpu : yes 2025-05-07T19:43:00.2878171Z fpu_exception : yes 2025-05-07T19:43:00.2878260Z cpuid level : 13 2025-05-07T19:43:00.2878339Z wp : yes 2025-05-07T19:43:00.2880527Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2880926Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2881077Z bogomips : 5999.99 2025-05-07T19:43:00.2881180Z clflush size : 64 2025-05-07T19:43:00.2881274Z cache_alignment : 64 2025-05-07T19:43:00.2881462Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2881568Z power management: 2025-05-07T19:43:00.2881572Z 2025-05-07T19:43:00.2881661Z processor : 38 2025-05-07T19:43:00.2881755Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2881846Z cpu family : 6 2025-05-07T19:43:00.2881945Z model : 85 2025-05-07T19:43:00.2882110Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2882200Z stepping : 7 2025-05-07T19:43:00.2882328Z microcode : 0x5003901 2025-05-07T19:43:00.2882425Z cpu MHz : 2999.996 2025-05-07T19:43:00.2882523Z cache size : 36608 KB 2025-05-07T19:43:00.2882621Z physical id : 1 2025-05-07T19:43:00.2882749Z siblings : 48 2025-05-07T19:43:00.2882843Z core id : 14 2025-05-07T19:43:00.2882942Z cpu cores : 24 2025-05-07T19:43:00.2883055Z apicid : 92 2025-05-07T19:43:00.2883149Z initial apicid : 92 2025-05-07T19:43:00.2883242Z fpu : yes 2025-05-07T19:43:00.2883445Z fpu_exception : yes 2025-05-07T19:43:00.2883558Z cpuid level : 13 2025-05-07T19:43:00.2883646Z wp : yes 2025-05-07T19:43:00.2885658Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2886053Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2886148Z bogomips : 5999.99 2025-05-07T19:43:00.2886237Z clflush size : 64 2025-05-07T19:43:00.2886355Z cache_alignment : 64 2025-05-07T19:43:00.2886491Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2886587Z power management: 2025-05-07T19:43:00.2886590Z 2025-05-07T19:43:00.2886714Z processor : 39 2025-05-07T19:43:00.2886816Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2886908Z cpu family : 6 2025-05-07T19:43:00.2886998Z model : 85 2025-05-07T19:43:00.2887194Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2887286Z stepping : 7 2025-05-07T19:43:00.2887382Z microcode : 0x5003901 2025-05-07T19:43:00.2887506Z cpu MHz : 2999.996 2025-05-07T19:43:00.2887602Z cache size : 36608 KB 2025-05-07T19:43:00.2887697Z physical id : 1 2025-05-07T19:43:00.2887787Z siblings : 48 2025-05-07T19:43:00.2887905Z core id : 15 2025-05-07T19:43:00.2887999Z cpu cores : 24 2025-05-07T19:43:00.2888095Z apicid : 94 2025-05-07T19:43:00.2888197Z initial apicid : 94 2025-05-07T19:43:00.2888312Z fpu : yes 2025-05-07T19:43:00.2888404Z fpu_exception : yes 2025-05-07T19:43:00.2888500Z cpuid level : 13 2025-05-07T19:43:00.2888613Z wp : yes 2025-05-07T19:43:00.2890641Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2891021Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2891189Z bogomips : 5999.99 2025-05-07T19:43:00.2891285Z clflush size : 64 2025-05-07T19:43:00.2891381Z cache_alignment : 64 2025-05-07T19:43:00.2891555Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2891699Z power management: 2025-05-07T19:43:00.2891703Z 2025-05-07T19:43:00.2891798Z processor : 40 2025-05-07T19:43:00.2891925Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2892022Z cpu family : 6 2025-05-07T19:43:00.2892107Z model : 85 2025-05-07T19:43:00.2892272Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2892389Z stepping : 7 2025-05-07T19:43:00.2892482Z microcode : 0x5003901 2025-05-07T19:43:00.2892570Z cpu MHz : 2999.996 2025-05-07T19:43:00.2892665Z cache size : 36608 KB 2025-05-07T19:43:00.2892778Z physical id : 1 2025-05-07T19:43:00.2892866Z siblings : 48 2025-05-07T19:43:00.2892955Z core id : 16 2025-05-07T19:43:00.2893065Z cpu cores : 24 2025-05-07T19:43:00.2893148Z apicid : 96 2025-05-07T19:43:00.2893239Z initial apicid : 96 2025-05-07T19:43:00.2893327Z fpu : yes 2025-05-07T19:43:00.2893444Z fpu_exception : yes 2025-05-07T19:43:00.2893535Z cpuid level : 13 2025-05-07T19:43:00.2893621Z wp : yes 2025-05-07T19:43:00.2894166Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:00.2896198Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2896600Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2896699Z bogomips : 5999.99 2025-05-07T19:43:00.2896793Z clflush size : 64 2025-05-07T19:43:00.2896887Z cache_alignment : 64 2025-05-07T19:43:00.2897046Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2897140Z power management: 2025-05-07T19:43:00.2897145Z 2025-05-07T19:43:00.2897237Z processor : 41 2025-05-07T19:43:00.2897362Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2897456Z cpu family : 6 2025-05-07T19:43:00.2897543Z model : 85 2025-05-07T19:43:00.2897709Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2897821Z stepping : 7 2025-05-07T19:43:00.2897915Z microcode : 0x5003901 2025-05-07T19:43:00.2898009Z cpu MHz : 3391.275 2025-05-07T19:43:00.2898124Z cache size : 36608 KB 2025-05-07T19:43:00.2898213Z physical id : 1 2025-05-07T19:43:00.2898300Z siblings : 48 2025-05-07T19:43:00.2898388Z core id : 17 2025-05-07T19:43:00.2898501Z cpu cores : 24 2025-05-07T19:43:00.2898592Z apicid : 98 2025-05-07T19:43:00.2898689Z initial apicid : 98 2025-05-07T19:43:00.2898776Z fpu : yes 2025-05-07T19:43:00.2898897Z fpu_exception : yes 2025-05-07T19:43:00.2898990Z cpuid level : 13 2025-05-07T19:43:00.2899076Z wp : yes 2025-05-07T19:43:00.2901139Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2901518Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2901713Z bogomips : 5999.99 2025-05-07T19:43:00.2901808Z clflush size : 64 2025-05-07T19:43:00.2901908Z cache_alignment : 64 2025-05-07T19:43:00.2902048Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2902174Z power management: 2025-05-07T19:43:00.2902222Z 2025-05-07T19:43:00.2902321Z processor : 42 2025-05-07T19:43:00.2902425Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2902547Z cpu family : 6 2025-05-07T19:43:00.2902637Z model : 85 2025-05-07T19:43:00.2902805Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2902894Z stepping : 7 2025-05-07T19:43:00.2903015Z microcode : 0x5003901 2025-05-07T19:43:00.2903105Z cpu MHz : 3244.043 2025-05-07T19:43:00.2903196Z cache size : 36608 KB 2025-05-07T19:43:00.2903313Z physical id : 1 2025-05-07T19:43:00.2903405Z siblings : 48 2025-05-07T19:43:00.2903495Z core id : 18 2025-05-07T19:43:00.2903587Z cpu cores : 24 2025-05-07T19:43:00.2903707Z apicid : 100 2025-05-07T19:43:00.2903807Z initial apicid : 100 2025-05-07T19:43:00.2903902Z fpu : yes 2025-05-07T19:43:00.2903996Z fpu_exception : yes 2025-05-07T19:43:00.2904108Z cpuid level : 13 2025-05-07T19:43:00.2904196Z wp : yes 2025-05-07T19:43:00.2906207Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2906604Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2906698Z bogomips : 5999.99 2025-05-07T19:43:00.2906792Z clflush size : 64 2025-05-07T19:43:00.2906907Z cache_alignment : 64 2025-05-07T19:43:00.2907041Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2907133Z power management: 2025-05-07T19:43:00.2907137Z 2025-05-07T19:43:00.2907251Z processor : 43 2025-05-07T19:43:00.2907348Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2907434Z cpu family : 6 2025-05-07T19:43:00.2907547Z model : 85 2025-05-07T19:43:00.2907708Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2907799Z stepping : 7 2025-05-07T19:43:00.2907893Z microcode : 0x5003901 2025-05-07T19:43:00.2908007Z cpu MHz : 2999.996 2025-05-07T19:43:00.2908098Z cache size : 36608 KB 2025-05-07T19:43:00.2908188Z physical id : 1 2025-05-07T19:43:00.2908278Z siblings : 48 2025-05-07T19:43:00.2908391Z core id : 19 2025-05-07T19:43:00.2908480Z cpu cores : 24 2025-05-07T19:43:00.2908572Z apicid : 102 2025-05-07T19:43:00.2908691Z initial apicid : 102 2025-05-07T19:43:00.2908776Z fpu : yes 2025-05-07T19:43:00.2908876Z fpu_exception : yes 2025-05-07T19:43:00.2908967Z cpuid level : 13 2025-05-07T19:43:00.2909076Z wp : yes 2025-05-07T19:43:00.2911110Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2911592Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2911685Z bogomips : 5999.99 2025-05-07T19:43:00.2911943Z clflush size : 64 2025-05-07T19:43:00.2912109Z cache_alignment : 64 2025-05-07T19:43:00.2912277Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2912376Z power management: 2025-05-07T19:43:00.2912380Z 2025-05-07T19:43:00.2912477Z processor : 44 2025-05-07T19:43:00.2912655Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2912770Z cpu family : 6 2025-05-07T19:43:00.2912863Z model : 85 2025-05-07T19:43:00.2913040Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2913162Z stepping : 7 2025-05-07T19:43:00.2913260Z microcode : 0x5003901 2025-05-07T19:43:00.2913355Z cpu MHz : 2999.996 2025-05-07T19:43:00.2913478Z cache size : 36608 KB 2025-05-07T19:43:00.2913574Z physical id : 1 2025-05-07T19:43:00.2913669Z siblings : 48 2025-05-07T19:43:00.2913760Z core id : 20 2025-05-07T19:43:00.2913878Z cpu cores : 24 2025-05-07T19:43:00.2913973Z apicid : 104 2025-05-07T19:43:00.2914072Z initial apicid : 104 2025-05-07T19:43:00.2914189Z fpu : yes 2025-05-07T19:43:00.2914290Z fpu_exception : yes 2025-05-07T19:43:00.2914392Z cpuid level : 13 2025-05-07T19:43:00.2914488Z wp : yes 2025-05-07T19:43:00.2916706Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2917118Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2917244Z bogomips : 5999.99 2025-05-07T19:43:00.2917342Z clflush size : 64 2025-05-07T19:43:00.2917447Z cache_alignment : 64 2025-05-07T19:43:00.2917597Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2917723Z power management: 2025-05-07T19:43:00.2917727Z 2025-05-07T19:43:00.2917826Z processor : 45 2025-05-07T19:43:00.2917931Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2918054Z cpu family : 6 2025-05-07T19:43:00.2918147Z model : 85 2025-05-07T19:43:00.2918321Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2918417Z stepping : 7 2025-05-07T19:43:00.2918541Z microcode : 0x5003901 2025-05-07T19:43:00.2918637Z cpu MHz : 3337.893 2025-05-07T19:43:00.2918735Z cache size : 36608 KB 2025-05-07T19:43:00.2918857Z physical id : 1 2025-05-07T19:43:00.2918950Z siblings : 48 2025-05-07T19:43:00.2919044Z core id : 21 2025-05-07T19:43:00.2919139Z cpu cores : 24 2025-05-07T19:43:00.2919260Z apicid : 106 2025-05-07T19:43:00.2919363Z initial apicid : 106 2025-05-07T19:43:00.2919453Z fpu : yes 2025-05-07T19:43:00.2919575Z fpu_exception : yes 2025-05-07T19:43:00.2919672Z cpuid level : 13 2025-05-07T19:43:00.2919769Z wp : yes 2025-05-07T19:43:00.2921983Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2922394Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2922495Z bogomips : 5999.99 2025-05-07T19:43:00.2922618Z clflush size : 64 2025-05-07T19:43:00.2922720Z cache_alignment : 64 2025-05-07T19:43:00.2922865Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2923021Z power management: 2025-05-07T19:43:00.2923025Z 2025-05-07T19:43:00.2923152Z processor : 46 2025-05-07T19:43:00.2923299Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2923399Z cpu family : 6 2025-05-07T19:43:00.2923526Z model : 85 2025-05-07T19:43:00.2923751Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2923850Z stepping : 7 2025-05-07T19:43:00.2923949Z microcode : 0x5003901 2025-05-07T19:43:00.2924185Z cpu MHz : 2999.996 2025-05-07T19:43:00.2924282Z cache size : 36608 KB 2025-05-07T19:43:00.2924378Z physical id : 1 2025-05-07T19:43:00.2924504Z siblings : 48 2025-05-07T19:43:00.2924598Z core id : 22 2025-05-07T19:43:00.2924692Z cpu cores : 24 2025-05-07T19:43:00.2924786Z apicid : 108 2025-05-07T19:43:00.2924917Z initial apicid : 108 2025-05-07T19:43:00.2925012Z fpu : yes 2025-05-07T19:43:00.2925113Z fpu_exception : yes 2025-05-07T19:43:00.2925239Z cpuid level : 13 2025-05-07T19:43:00.2925336Z wp : yes 2025-05-07T19:43:00.2927363Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2927774Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2927871Z bogomips : 5999.99 2025-05-07T19:43:00.2927967Z clflush size : 64 2025-05-07T19:43:00.2928098Z cache_alignment : 64 2025-05-07T19:43:00.2928237Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2928336Z power management: 2025-05-07T19:43:00.2928343Z 2025-05-07T19:43:00.2928439Z processor : 47 2025-05-07T19:43:00.2928569Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2928662Z cpu family : 6 2025-05-07T19:43:00.2928756Z model : 85 2025-05-07T19:43:00.2928954Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2929048Z stepping : 7 2025-05-07T19:43:00.2929147Z microcode : 0x5003901 2025-05-07T19:43:00.2929243Z cpu MHz : 2999.996 2025-05-07T19:43:00.2929373Z cache size : 36608 KB 2025-05-07T19:43:00.2929472Z physical id : 1 2025-05-07T19:43:00.2929568Z siblings : 48 2025-05-07T19:43:00.2929680Z core id : 23 2025-05-07T19:43:00.2929770Z cpu cores : 24 2025-05-07T19:43:00.2929865Z apicid : 110 2025-05-07T19:43:00.2929966Z initial apicid : 110 2025-05-07T19:43:00.2930085Z fpu : yes 2025-05-07T19:43:00.2930184Z fpu_exception : yes 2025-05-07T19:43:00.2930275Z cpuid level : 13 2025-05-07T19:43:00.2930369Z wp : yes 2025-05-07T19:43:00.2932416Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2932797Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2932913Z bogomips : 5999.99 2025-05-07T19:43:00.2933005Z clflush size : 64 2025-05-07T19:43:00.2933096Z cache_alignment : 64 2025-05-07T19:43:00.2933253Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2933345Z power management: 2025-05-07T19:43:00.2933349Z 2025-05-07T19:43:00.2933488Z processor : 48 2025-05-07T19:43:00.2933585Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2933697Z cpu family : 6 2025-05-07T19:43:00.2933782Z model : 85 2025-05-07T19:43:00.2933943Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2934264Z stepping : 7 2025-05-07T19:43:00.2934363Z microcode : 0x5003901 2025-05-07T19:43:00.2934453Z cpu MHz : 2999.996 2025-05-07T19:43:00.2934547Z cache size : 36608 KB 2025-05-07T19:43:00.2934668Z physical id : 0 2025-05-07T19:43:00.2934760Z siblings : 48 2025-05-07T19:43:00.2934849Z core id : 0 2025-05-07T19:43:00.2934964Z cpu cores : 24 2025-05-07T19:43:00.2935053Z apicid : 1 2025-05-07T19:43:00.2935147Z initial apicid : 1 2025-05-07T19:43:00.2935235Z fpu : yes 2025-05-07T19:43:00.2935369Z fpu_exception : yes 2025-05-07T19:43:00.2935462Z cpuid level : 13 2025-05-07T19:43:00.2935557Z wp : yes 2025-05-07T19:43:00.2937600Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2937984Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2938076Z bogomips : 5999.99 2025-05-07T19:43:00.2955383Z clflush size : 64 2025-05-07T19:43:00.2955548Z cache_alignment : 64 2025-05-07T19:43:00.2955701Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2955798Z power management: 2025-05-07T19:43:00.2955805Z 2025-05-07T19:43:00.2955899Z processor : 49 2025-05-07T19:43:00.2956007Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2956108Z cpu family : 6 2025-05-07T19:43:00.2956198Z model : 85 2025-05-07T19:43:00.2956389Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2956475Z stepping : 7 2025-05-07T19:43:00.2956577Z microcode : 0x5003901 2025-05-07T19:43:00.2956667Z cpu MHz : 2999.996 2025-05-07T19:43:00.2956774Z cache size : 36608 KB 2025-05-07T19:43:00.2956865Z physical id : 0 2025-05-07T19:43:00.2956960Z siblings : 48 2025-05-07T19:43:00.2957060Z core id : 1 2025-05-07T19:43:00.2957143Z cpu cores : 24 2025-05-07T19:43:00.2957229Z apicid : 3 2025-05-07T19:43:00.2957326Z initial apicid : 3 2025-05-07T19:43:00.2957426Z fpu : yes 2025-05-07T19:43:00.2957514Z fpu_exception : yes 2025-05-07T19:43:00.2957598Z cpuid level : 13 2025-05-07T19:43:00.2957696Z wp : yes 2025-05-07T19:43:00.2959919Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2960319Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2960430Z bogomips : 5999.99 2025-05-07T19:43:00.2960523Z clflush size : 64 2025-05-07T19:43:00.2960610Z cache_alignment : 64 2025-05-07T19:43:00.2960766Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2960862Z power management: 2025-05-07T19:43:00.2960867Z 2025-05-07T19:43:00.2960955Z processor : 50 2025-05-07T19:43:00.2961053Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2961148Z cpu family : 6 2025-05-07T19:43:00.2961373Z model : 85 2025-05-07T19:43:00.2961545Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2961644Z stepping : 7 2025-05-07T19:43:00.2961743Z microcode : 0x5003901 2025-05-07T19:43:00.2961892Z cpu MHz : 2999.996 2025-05-07T19:43:00.2961983Z cache size : 36608 KB 2025-05-07T19:43:00.2962088Z physical id : 0 2025-05-07T19:43:00.2962174Z siblings : 48 2025-05-07T19:43:00.2962260Z core id : 2 2025-05-07T19:43:00.2962359Z cpu cores : 24 2025-05-07T19:43:00.2962443Z apicid : 5 2025-05-07T19:43:00.2962535Z initial apicid : 5 2025-05-07T19:43:00.2962614Z fpu : yes 2025-05-07T19:43:00.2962718Z fpu_exception : yes 2025-05-07T19:43:00.2962807Z cpuid level : 13 2025-05-07T19:43:00.2962888Z wp : yes 2025-05-07T19:43:00.2965273Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2965688Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2965775Z bogomips : 5999.99 2025-05-07T19:43:00.2965893Z clflush size : 64 2025-05-07T19:43:00.2965988Z cache_alignment : 64 2025-05-07T19:43:00.2966134Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2966242Z power management: 2025-05-07T19:43:00.2966247Z 2025-05-07T19:43:00.2966337Z processor : 51 2025-05-07T19:43:00.2966440Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2966528Z cpu family : 6 2025-05-07T19:43:00.2966630Z model : 85 2025-05-07T19:43:00.2966801Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2966893Z stepping : 7 2025-05-07T19:43:00.2966997Z microcode : 0x5003901 2025-05-07T19:43:00.2967079Z cpu MHz : 1198.821 2025-05-07T19:43:00.2967170Z cache size : 36608 KB 2025-05-07T19:43:00.2967254Z physical id : 0 2025-05-07T19:43:00.2967353Z siblings : 48 2025-05-07T19:43:00.2967434Z core id : 3 2025-05-07T19:43:00.2967527Z cpu cores : 24 2025-05-07T19:43:00.2967622Z apicid : 7 2025-05-07T19:43:00.2967711Z initial apicid : 7 2025-05-07T19:43:00.2967796Z fpu : yes 2025-05-07T19:43:00.2967893Z fpu_exception : yes 2025-05-07T19:43:00.2967983Z cpuid level : 13 2025-05-07T19:43:00.2968069Z wp : yes 2025-05-07T19:43:00.2970247Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2970662Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2970749Z bogomips : 5999.99 2025-05-07T19:43:00.2970837Z clflush size : 64 2025-05-07T19:43:00.2970951Z cache_alignment : 64 2025-05-07T19:43:00.2971083Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2971174Z power management: 2025-05-07T19:43:00.2971178Z 2025-05-07T19:43:00.2971278Z processor : 52 2025-05-07T19:43:00.2971374Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2971457Z cpu family : 6 2025-05-07T19:43:00.2971538Z model : 85 2025-05-07T19:43:00.2971712Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2971900Z stepping : 7 2025-05-07T19:43:00.2971989Z microcode : 0x5003901 2025-05-07T19:43:00.2972079Z cpu MHz : 2999.996 2025-05-07T19:43:00.2972162Z cache size : 36608 KB 2025-05-07T19:43:00.2972315Z physical id : 0 2025-05-07T19:43:00.2972395Z siblings : 48 2025-05-07T19:43:00.2972484Z core id : 4 2025-05-07T19:43:00.2972566Z cpu cores : 24 2025-05-07T19:43:00.2972646Z apicid : 9 2025-05-07T19:43:00.2972727Z initial apicid : 9 2025-05-07T19:43:00.2972819Z fpu : yes 2025-05-07T19:43:00.2972903Z fpu_exception : yes 2025-05-07T19:43:00.2972987Z cpuid level : 13 2025-05-07T19:43:00.2973077Z wp : yes 2025-05-07T19:43:00.2975256Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2975659Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2975753Z bogomips : 5999.99 2025-05-07T19:43:00.2975837Z clflush size : 64 2025-05-07T19:43:00.2975922Z cache_alignment : 64 2025-05-07T19:43:00.2976064Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2976153Z power management: 2025-05-07T19:43:00.2976158Z 2025-05-07T19:43:00.2976242Z processor : 53 2025-05-07T19:43:00.2976354Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2976447Z cpu family : 6 2025-05-07T19:43:00.2976527Z model : 85 2025-05-07T19:43:00.2976688Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2976902Z stepping : 7 2025-05-07T19:43:00.2976988Z microcode : 0x5003901 2025-05-07T19:43:00.2977069Z cpu MHz : 1201.719 2025-05-07T19:43:00.2977161Z cache size : 36608 KB 2025-05-07T19:43:00.2977246Z physical id : 0 2025-05-07T19:43:00.2977323Z siblings : 48 2025-05-07T19:43:00.2977401Z core id : 5 2025-05-07T19:43:00.2977500Z cpu cores : 24 2025-05-07T19:43:00.2977585Z apicid : 11 2025-05-07T19:43:00.2977671Z initial apicid : 11 2025-05-07T19:43:00.2977751Z fpu : yes 2025-05-07T19:43:00.2977846Z fpu_exception : yes 2025-05-07T19:43:00.2977928Z cpuid level : 13 2025-05-07T19:43:00.2978001Z wp : yes 2025-05-07T19:43:00.2980028Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2980402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2980489Z bogomips : 5999.99 2025-05-07T19:43:00.2980583Z clflush size : 64 2025-05-07T19:43:00.2980667Z cache_alignment : 64 2025-05-07T19:43:00.2980805Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2980904Z power management: 2025-05-07T19:43:00.2980908Z 2025-05-07T19:43:00.2980988Z processor : 54 2025-05-07T19:43:00.2981074Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2981157Z cpu family : 6 2025-05-07T19:43:00.2981250Z model : 85 2025-05-07T19:43:00.2981409Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2981494Z stepping : 7 2025-05-07T19:43:00.2981600Z microcode : 0x5003901 2025-05-07T19:43:00.2981732Z cpu MHz : 1199.534 2025-05-07T19:43:00.2981814Z cache size : 36608 KB 2025-05-07T19:43:00.2981899Z physical id : 0 2025-05-07T19:43:00.2981990Z siblings : 48 2025-05-07T19:43:00.2982065Z core id : 6 2025-05-07T19:43:00.2982193Z cpu cores : 24 2025-05-07T19:43:00.2982284Z apicid : 13 2025-05-07T19:43:00.2982371Z initial apicid : 13 2025-05-07T19:43:00.2982444Z fpu : yes 2025-05-07T19:43:00.2982535Z fpu_exception : yes 2025-05-07T19:43:00.2982638Z cpuid level : 13 2025-05-07T19:43:00.2982716Z wp : yes 2025-05-07T19:43:00.2984725Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2985112Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2985194Z bogomips : 5999.99 2025-05-07T19:43:00.2985277Z clflush size : 64 2025-05-07T19:43:00.2985382Z cache_alignment : 64 2025-05-07T19:43:00.2985505Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2985589Z power management: 2025-05-07T19:43:00.2985593Z 2025-05-07T19:43:00.2985677Z processor : 55 2025-05-07T19:43:00.2985760Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2985833Z cpu family : 6 2025-05-07T19:43:00.2985909Z model : 85 2025-05-07T19:43:00.2986067Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2986143Z stepping : 7 2025-05-07T19:43:00.2986220Z microcode : 0x5003901 2025-05-07T19:43:00.2986309Z cpu MHz : 1204.615 2025-05-07T19:43:00.2986391Z cache size : 36608 KB 2025-05-07T19:43:00.2986467Z physical id : 0 2025-05-07T19:43:00.2986542Z siblings : 48 2025-05-07T19:43:00.2986630Z core id : 7 2025-05-07T19:43:00.2986704Z cpu cores : 24 2025-05-07T19:43:00.2986785Z apicid : 15 2025-05-07T19:43:00.2986890Z initial apicid : 15 2025-05-07T19:43:00.2986968Z fpu : yes 2025-05-07T19:43:00.2987049Z fpu_exception : yes 2025-05-07T19:43:00.2987136Z cpuid level : 13 2025-05-07T19:43:00.2987228Z wp : yes 2025-05-07T19:43:00.2989223Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2989609Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2989695Z bogomips : 5999.99 2025-05-07T19:43:00.2989784Z clflush size : 64 2025-05-07T19:43:00.2989865Z cache_alignment : 64 2025-05-07T19:43:00.2990002Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2990090Z power management: 2025-05-07T19:43:00.2990094Z 2025-05-07T19:43:00.2990170Z processor : 56 2025-05-07T19:43:00.2990275Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2990353Z cpu family : 6 2025-05-07T19:43:00.2990432Z model : 85 2025-05-07T19:43:00.2990582Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2990672Z stepping : 7 2025-05-07T19:43:00.2990755Z microcode : 0x5003901 2025-05-07T19:43:00.2990830Z cpu MHz : 1198.743 2025-05-07T19:43:00.2990921Z cache size : 36608 KB 2025-05-07T19:43:00.2991050Z physical id : 0 2025-05-07T19:43:00.2991124Z siblings : 48 2025-05-07T19:43:00.2991197Z core id : 8 2025-05-07T19:43:00.2991288Z cpu cores : 24 2025-05-07T19:43:00.2991507Z apicid : 17 2025-05-07T19:43:00.2991657Z initial apicid : 17 2025-05-07T19:43:00.2991904Z fpu : yes 2025-05-07T19:43:00.2991990Z fpu_exception : yes 2025-05-07T19:43:00.2992070Z cpuid level : 13 2025-05-07T19:43:00.2992145Z wp : yes 2025-05-07T19:43:00.2994324Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2994720Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2994813Z bogomips : 5999.99 2025-05-07T19:43:00.2994899Z clflush size : 64 2025-05-07T19:43:00.2994985Z cache_alignment : 64 2025-05-07T19:43:00.2995120Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.2995216Z power management: 2025-05-07T19:43:00.2995220Z 2025-05-07T19:43:00.2995301Z processor : 57 2025-05-07T19:43:00.2995397Z vendor_id : GenuineIntel 2025-05-07T19:43:00.2995494Z cpu family : 6 2025-05-07T19:43:00.2995574Z model : 85 2025-05-07T19:43:00.2995741Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.2995826Z stepping : 7 2025-05-07T19:43:00.2995934Z microcode : 0x5003901 2025-05-07T19:43:00.2996018Z cpu MHz : 1199.432 2025-05-07T19:43:00.2996111Z cache size : 36608 KB 2025-05-07T19:43:00.2996214Z physical id : 0 2025-05-07T19:43:00.2996302Z siblings : 48 2025-05-07T19:43:00.2996389Z core id : 9 2025-05-07T19:43:00.2996480Z cpu cores : 24 2025-05-07T19:43:00.2996568Z apicid : 19 2025-05-07T19:43:00.2996654Z initial apicid : 19 2025-05-07T19:43:00.2996736Z fpu : yes 2025-05-07T19:43:00.2996834Z fpu_exception : yes 2025-05-07T19:43:00.2996930Z cpuid level : 13 2025-05-07T19:43:00.2997011Z wp : yes 2025-05-07T19:43:00.2999202Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.2999615Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.2999705Z bogomips : 5999.99 2025-05-07T19:43:00.2999804Z clflush size : 64 2025-05-07T19:43:00.2999897Z cache_alignment : 64 2025-05-07T19:43:00.3000029Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3000117Z power management: 2025-05-07T19:43:00.3000121Z 2025-05-07T19:43:00.3000228Z processor : 58 2025-05-07T19:43:00.3000320Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3000408Z cpu family : 6 2025-05-07T19:43:00.3000501Z model : 85 2025-05-07T19:43:00.3000671Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3000759Z stepping : 7 2025-05-07T19:43:00.3000847Z microcode : 0x5003901 2025-05-07T19:43:00.3000947Z cpu MHz : 1199.520 2025-05-07T19:43:00.3001033Z cache size : 36608 KB 2025-05-07T19:43:00.3001118Z physical id : 0 2025-05-07T19:43:00.3001226Z siblings : 48 2025-05-07T19:43:00.3001356Z core id : 10 2025-05-07T19:43:00.3001438Z cpu cores : 24 2025-05-07T19:43:00.3001518Z apicid : 21 2025-05-07T19:43:00.3001626Z initial apicid : 21 2025-05-07T19:43:00.3001706Z fpu : yes 2025-05-07T19:43:00.3001795Z fpu_exception : yes 2025-05-07T19:43:00.3001927Z cpuid level : 13 2025-05-07T19:43:00.3002024Z wp : yes 2025-05-07T19:43:00.3004297Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3004676Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3004756Z bogomips : 5999.99 2025-05-07T19:43:00.3004840Z clflush size : 64 2025-05-07T19:43:00.3004928Z cache_alignment : 64 2025-05-07T19:43:00.3005067Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3005155Z power management: 2025-05-07T19:43:00.3005159Z 2025-05-07T19:43:00.3005240Z processor : 59 2025-05-07T19:43:00.3005345Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3005422Z cpu family : 6 2025-05-07T19:43:00.3005497Z model : 85 2025-05-07T19:43:00.3005660Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3005737Z stepping : 7 2025-05-07T19:43:00.3005817Z microcode : 0x5003901 2025-05-07T19:43:00.3005897Z cpu MHz : 1199.974 2025-05-07T19:43:00.3005987Z cache size : 36608 KB 2025-05-07T19:43:00.3006069Z physical id : 0 2025-05-07T19:43:00.3006152Z siblings : 48 2025-05-07T19:43:00.3006230Z core id : 11 2025-05-07T19:43:00.3006321Z cpu cores : 24 2025-05-07T19:43:00.3006402Z apicid : 23 2025-05-07T19:43:00.3006482Z initial apicid : 23 2025-05-07T19:43:00.3006572Z fpu : yes 2025-05-07T19:43:00.3006651Z fpu_exception : yes 2025-05-07T19:43:00.3006725Z cpuid level : 13 2025-05-07T19:43:00.3006806Z wp : yes 2025-05-07T19:43:00.3008818Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3009180Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3009275Z bogomips : 5999.99 2025-05-07T19:43:00.3009356Z clflush size : 64 2025-05-07T19:43:00.3009438Z cache_alignment : 64 2025-05-07T19:43:00.3009563Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3009660Z power management: 2025-05-07T19:43:00.3009664Z 2025-05-07T19:43:00.3009742Z processor : 60 2025-05-07T19:43:00.3009828Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3009924Z cpu family : 6 2025-05-07T19:43:00.3009999Z model : 85 2025-05-07T19:43:00.3010170Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3010256Z stepping : 7 2025-05-07T19:43:00.3010334Z microcode : 0x5003901 2025-05-07T19:43:00.3010409Z cpu MHz : 1200.573 2025-05-07T19:43:00.3010502Z cache size : 36608 KB 2025-05-07T19:43:00.3010582Z physical id : 0 2025-05-07T19:43:00.3010660Z siblings : 48 2025-05-07T19:43:00.3010734Z core id : 12 2025-05-07T19:43:00.3010830Z cpu cores : 24 2025-05-07T19:43:00.3010957Z apicid : 25 2025-05-07T19:43:00.3011033Z initial apicid : 25 2025-05-07T19:43:00.3011121Z fpu : yes 2025-05-07T19:43:00.3011204Z fpu_exception : yes 2025-05-07T19:43:00.3011285Z cpuid level : 13 2025-05-07T19:43:00.3011363Z wp : yes 2025-05-07T19:43:00.3013434Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3013803Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3013904Z bogomips : 5999.99 2025-05-07T19:43:00.3013984Z clflush size : 64 2025-05-07T19:43:00.3014066Z cache_alignment : 64 2025-05-07T19:43:00.3014189Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3014293Z power management: 2025-05-07T19:43:00.3014297Z 2025-05-07T19:43:00.3014373Z processor : 61 2025-05-07T19:43:00.3014460Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3014550Z cpu family : 6 2025-05-07T19:43:00.3014628Z model : 85 2025-05-07T19:43:00.3014776Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3014865Z stepping : 7 2025-05-07T19:43:00.3014949Z microcode : 0x5003901 2025-05-07T19:43:00.3015028Z cpu MHz : 1201.063 2025-05-07T19:43:00.3015104Z cache size : 36608 KB 2025-05-07T19:43:00.3015196Z physical id : 0 2025-05-07T19:43:00.3015271Z siblings : 48 2025-05-07T19:43:00.3015346Z core id : 13 2025-05-07T19:43:00.3015427Z cpu cores : 24 2025-05-07T19:43:00.3015527Z apicid : 27 2025-05-07T19:43:00.3015604Z initial apicid : 27 2025-05-07T19:43:00.3015683Z fpu : yes 2025-05-07T19:43:00.3015779Z fpu_exception : yes 2025-05-07T19:43:00.3015858Z cpuid level : 13 2025-05-07T19:43:00.3015936Z wp : yes 2025-05-07T19:43:00.3017976Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3018342Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3018418Z bogomips : 5999.99 2025-05-07T19:43:00.3018528Z clflush size : 64 2025-05-07T19:43:00.3018613Z cache_alignment : 64 2025-05-07T19:43:00.3018735Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3018813Z power management: 2025-05-07T19:43:00.3018826Z 2025-05-07T19:43:00.3018907Z processor : 62 2025-05-07T19:43:00.3018990Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3019064Z cpu family : 6 2025-05-07T19:43:00.3019156Z model : 85 2025-05-07T19:43:00.3019306Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3019386Z stepping : 7 2025-05-07T19:43:00.3019469Z microcode : 0x5003901 2025-05-07T19:43:00.3019572Z cpu MHz : 1201.023 2025-05-07T19:43:00.3019649Z cache size : 36608 KB 2025-05-07T19:43:00.3019727Z physical id : 0 2025-05-07T19:43:00.3019817Z siblings : 48 2025-05-07T19:43:00.3019891Z core id : 14 2025-05-07T19:43:00.3019968Z cpu cores : 24 2025-05-07T19:43:00.3020040Z apicid : 29 2025-05-07T19:43:00.3020139Z initial apicid : 29 2025-05-07T19:43:00.3020210Z fpu : yes 2025-05-07T19:43:00.3020332Z fpu_exception : yes 2025-05-07T19:43:00.3020422Z cpuid level : 13 2025-05-07T19:43:00.3020495Z wp : yes 2025-05-07T19:43:00.3022571Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3022958Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3023034Z bogomips : 5999.99 2025-05-07T19:43:00.3023113Z clflush size : 64 2025-05-07T19:43:00.3023224Z cache_alignment : 64 2025-05-07T19:43:00.3023349Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3023425Z power management: 2025-05-07T19:43:00.3023429Z 2025-05-07T19:43:00.3023507Z processor : 63 2025-05-07T19:43:00.3023606Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3023684Z cpu family : 6 2025-05-07T19:43:00.3023755Z model : 85 2025-05-07T19:43:00.3023930Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3024005Z stepping : 7 2025-05-07T19:43:00.3024083Z microcode : 0x5003901 2025-05-07T19:43:00.3024163Z cpu MHz : 1200.415 2025-05-07T19:43:00.3024254Z cache size : 36608 KB 2025-05-07T19:43:00.3024333Z physical id : 0 2025-05-07T19:43:00.3024412Z siblings : 48 2025-05-07T19:43:00.3024504Z core id : 15 2025-05-07T19:43:00.3024577Z cpu cores : 24 2025-05-07T19:43:00.3024650Z apicid : 31 2025-05-07T19:43:00.3024733Z initial apicid : 31 2025-05-07T19:43:00.3024825Z fpu : yes 2025-05-07T19:43:00.3024907Z fpu_exception : yes 2025-05-07T19:43:00.3024989Z cpuid level : 13 2025-05-07T19:43:00.3025075Z wp : yes 2025-05-07T19:43:00.3027083Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3027455Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3027537Z bogomips : 5999.99 2025-05-07T19:43:00.3027614Z clflush size : 64 2025-05-07T19:43:00.3027692Z cache_alignment : 64 2025-05-07T19:43:00.3027824Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3027901Z power management: 2025-05-07T19:43:00.3027905Z 2025-05-07T19:43:00.3027979Z processor : 64 2025-05-07T19:43:00.3028060Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3028146Z cpu family : 6 2025-05-07T19:43:00.3028218Z model : 85 2025-05-07T19:43:00.3028374Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3028476Z stepping : 7 2025-05-07T19:43:00.3028555Z microcode : 0x5003901 2025-05-07T19:43:00.3028629Z cpu MHz : 2999.996 2025-05-07T19:43:00.3028714Z cache size : 36608 KB 2025-05-07T19:43:00.3028800Z physical id : 0 2025-05-07T19:43:00.3028876Z siblings : 48 2025-05-07T19:43:00.3028947Z core id : 16 2025-05-07T19:43:00.3029028Z cpu cores : 24 2025-05-07T19:43:00.3029098Z apicid : 33 2025-05-07T19:43:00.3029179Z initial apicid : 33 2025-05-07T19:43:00.3029250Z fpu : yes 2025-05-07T19:43:00.3029353Z fpu_exception : yes 2025-05-07T19:43:00.3029429Z cpuid level : 13 2025-05-07T19:43:00.3029551Z wp : yes 2025-05-07T19:43:00.3031721Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3032287Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3032376Z bogomips : 5999.99 2025-05-07T19:43:00.3032483Z clflush size : 64 2025-05-07T19:43:00.3032574Z cache_alignment : 64 2025-05-07T19:43:00.3032713Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3032818Z power management: 2025-05-07T19:43:00.3032822Z 2025-05-07T19:43:00.3032905Z processor : 65 2025-05-07T19:43:00.3032997Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3033086Z cpu family : 6 2025-05-07T19:43:00.3033185Z model : 85 2025-05-07T19:43:00.3033353Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3033440Z stepping : 7 2025-05-07T19:43:00.3033552Z microcode : 0x5003901 2025-05-07T19:43:00.3033636Z cpu MHz : 2236.208 2025-05-07T19:43:00.3033726Z cache size : 36608 KB 2025-05-07T19:43:00.3033811Z physical id : 0 2025-05-07T19:43:00.3033906Z siblings : 48 2025-05-07T19:43:00.3033986Z core id : 17 2025-05-07T19:43:00.3034075Z cpu cores : 24 2025-05-07T19:43:00.3034176Z apicid : 35 2025-05-07T19:43:00.3034269Z initial apicid : 35 2025-05-07T19:43:00.3034349Z fpu : yes 2025-05-07T19:43:00.3034440Z fpu_exception : yes 2025-05-07T19:43:00.3034541Z cpuid level : 13 2025-05-07T19:43:00.3034621Z wp : yes 2025-05-07T19:43:00.3036817Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3037225Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3037306Z bogomips : 5999.99 2025-05-07T19:43:00.3037386Z clflush size : 64 2025-05-07T19:43:00.3037486Z cache_alignment : 64 2025-05-07T19:43:00.3037617Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3037700Z power management: 2025-05-07T19:43:00.3037708Z 2025-05-07T19:43:00.3037813Z processor : 66 2025-05-07T19:43:00.3037899Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3037976Z cpu family : 6 2025-05-07T19:43:00.3038051Z model : 85 2025-05-07T19:43:00.3038231Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3038308Z stepping : 7 2025-05-07T19:43:00.3038390Z microcode : 0x5003901 2025-05-07T19:43:00.3038499Z cpu MHz : 2999.996 2025-05-07T19:43:00.3038583Z cache size : 36608 KB 2025-05-07T19:43:00.3038671Z physical id : 0 2025-05-07T19:43:00.3038752Z siblings : 48 2025-05-07T19:43:00.3038857Z core id : 18 2025-05-07T19:43:00.3038934Z cpu cores : 24 2025-05-07T19:43:00.3039009Z apicid : 37 2025-05-07T19:43:00.3039094Z initial apicid : 37 2025-05-07T19:43:00.3039182Z fpu : yes 2025-05-07T19:43:00.3039265Z fpu_exception : yes 2025-05-07T19:43:00.3039344Z cpuid level : 13 2025-05-07T19:43:00.3039433Z wp : yes 2025-05-07T19:43:00.3041671Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3042120Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3042213Z bogomips : 5999.99 2025-05-07T19:43:00.3042296Z clflush size : 64 2025-05-07T19:43:00.3042388Z cache_alignment : 64 2025-05-07T19:43:00.3042544Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3042627Z power management: 2025-05-07T19:43:00.3042631Z 2025-05-07T19:43:00.3042715Z processor : 67 2025-05-07T19:43:00.3042826Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3042905Z cpu family : 6 2025-05-07T19:43:00.3042988Z model : 85 2025-05-07T19:43:00.3043161Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3043269Z stepping : 7 2025-05-07T19:43:00.3043356Z microcode : 0x5003901 2025-05-07T19:43:00.3043439Z cpu MHz : 1199.897 2025-05-07T19:43:00.3043549Z cache size : 36608 KB 2025-05-07T19:43:00.3043631Z physical id : 0 2025-05-07T19:43:00.3043712Z siblings : 48 2025-05-07T19:43:00.3043797Z core id : 19 2025-05-07T19:43:00.3043899Z cpu cores : 24 2025-05-07T19:43:00.3044086Z apicid : 39 2025-05-07T19:43:00.3044172Z initial apicid : 39 2025-05-07T19:43:00.3044244Z fpu : yes 2025-05-07T19:43:00.3044346Z fpu_exception : yes 2025-05-07T19:43:00.3044425Z cpuid level : 13 2025-05-07T19:43:00.3044503Z wp : yes 2025-05-07T19:43:00.3046542Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3046911Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3046992Z bogomips : 5999.99 2025-05-07T19:43:00.3047089Z clflush size : 64 2025-05-07T19:43:00.3047175Z cache_alignment : 64 2025-05-07T19:43:00.3047302Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3047398Z power management: 2025-05-07T19:43:00.3047402Z 2025-05-07T19:43:00.3047479Z processor : 68 2025-05-07T19:43:00.3047567Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3047645Z cpu family : 6 2025-05-07T19:43:00.3047731Z model : 85 2025-05-07T19:43:00.3047881Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3047962Z stepping : 7 2025-05-07T19:43:00.3048066Z microcode : 0x5003901 2025-05-07T19:43:00.3048143Z cpu MHz : 1200.393 2025-05-07T19:43:00.3048220Z cache size : 36608 KB 2025-05-07T19:43:00.3048295Z physical id : 0 2025-05-07T19:43:00.3048386Z siblings : 48 2025-05-07T19:43:00.3048458Z core id : 20 2025-05-07T19:43:00.3048540Z cpu cores : 24 2025-05-07T19:43:00.3048623Z apicid : 41 2025-05-07T19:43:00.3048703Z initial apicid : 41 2025-05-07T19:43:00.3048776Z fpu : yes 2025-05-07T19:43:00.3048857Z fpu_exception : yes 2025-05-07T19:43:00.3048947Z cpuid level : 13 2025-05-07T19:43:00.3049020Z wp : yes 2025-05-07T19:43:00.3051078Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3051506Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3051587Z bogomips : 5999.99 2025-05-07T19:43:00.3051666Z clflush size : 64 2025-05-07T19:43:00.3051763Z cache_alignment : 64 2025-05-07T19:43:00.3051886Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3051972Z power management: 2025-05-07T19:43:00.3051976Z 2025-05-07T19:43:00.3052069Z processor : 69 2025-05-07T19:43:00.3052157Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3052242Z cpu family : 6 2025-05-07T19:43:00.3052319Z model : 85 2025-05-07T19:43:00.3052486Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3052561Z stepping : 7 2025-05-07T19:43:00.3052646Z microcode : 0x5003901 2025-05-07T19:43:00.3052742Z cpu MHz : 2999.996 2025-05-07T19:43:00.3052819Z cache size : 36608 KB 2025-05-07T19:43:00.3052898Z physical id : 0 2025-05-07T19:43:00.3052978Z siblings : 48 2025-05-07T19:43:00.3053072Z core id : 21 2025-05-07T19:43:00.3053147Z cpu cores : 24 2025-05-07T19:43:00.3053217Z apicid : 43 2025-05-07T19:43:00.3053310Z initial apicid : 43 2025-05-07T19:43:00.3053380Z fpu : yes 2025-05-07T19:43:00.3053458Z fpu_exception : yes 2025-05-07T19:43:00.3053534Z cpuid level : 13 2025-05-07T19:43:00.3053610Z wp : yes 2025-05-07T19:43:00.3055625Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3055999Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3056074Z bogomips : 5999.99 2025-05-07T19:43:00.3056150Z clflush size : 64 2025-05-07T19:43:00.3056226Z cache_alignment : 64 2025-05-07T19:43:00.3056356Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3056430Z power management: 2025-05-07T19:43:00.3056434Z 2025-05-07T19:43:00.3056505Z processor : 70 2025-05-07T19:43:00.3056594Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3056666Z cpu family : 6 2025-05-07T19:43:00.3056739Z model : 85 2025-05-07T19:43:00.3056892Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3056973Z stepping : 7 2025-05-07T19:43:00.3057049Z microcode : 0x5003901 2025-05-07T19:43:00.3057120Z cpu MHz : 2999.996 2025-05-07T19:43:00.3057205Z cache size : 36608 KB 2025-05-07T19:43:00.3057278Z physical id : 0 2025-05-07T19:43:00.3057347Z siblings : 48 2025-05-07T19:43:00.3057415Z core id : 22 2025-05-07T19:43:00.3057493Z cpu cores : 24 2025-05-07T19:43:00.3057562Z apicid : 45 2025-05-07T19:43:00.3057637Z initial apicid : 45 2025-05-07T19:43:00.3057713Z fpu : yes 2025-05-07T19:43:00.3057790Z fpu_exception : yes 2025-05-07T19:43:00.3057862Z cpuid level : 13 2025-05-07T19:43:00.3057931Z wp : yes 2025-05-07T19:43:00.3059992Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3060436Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3060533Z bogomips : 5999.99 2025-05-07T19:43:00.3060626Z clflush size : 64 2025-05-07T19:43:00.3060726Z cache_alignment : 64 2025-05-07T19:43:00.3060862Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3060981Z power management: 2025-05-07T19:43:00.3060984Z 2025-05-07T19:43:00.3061077Z processor : 71 2025-05-07T19:43:00.3061177Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3061295Z cpu family : 6 2025-05-07T19:43:00.3061387Z model : 85 2025-05-07T19:43:00.3061554Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3061688Z stepping : 7 2025-05-07T19:43:00.3061815Z microcode : 0x5003901 2025-05-07T19:43:00.3061906Z cpu MHz : 1200.020 2025-05-07T19:43:00.3062003Z cache size : 36608 KB 2025-05-07T19:43:00.3062126Z physical id : 0 2025-05-07T19:43:00.3062218Z siblings : 48 2025-05-07T19:43:00.3062307Z core id : 23 2025-05-07T19:43:00.3062401Z cpu cores : 24 2025-05-07T19:43:00.3062519Z apicid : 47 2025-05-07T19:43:00.3062617Z initial apicid : 47 2025-05-07T19:43:00.3062709Z fpu : yes 2025-05-07T19:43:00.3062833Z fpu_exception : yes 2025-05-07T19:43:00.3062926Z cpuid level : 13 2025-05-07T19:43:00.3063019Z wp : yes 2025-05-07T19:43:00.3065369Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3065915Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3066017Z bogomips : 5999.99 2025-05-07T19:43:00.3066146Z clflush size : 64 2025-05-07T19:43:00.3066292Z cache_alignment : 64 2025-05-07T19:43:00.3066444Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3066548Z power management: 2025-05-07T19:43:00.3066553Z 2025-05-07T19:43:00.3066682Z processor : 72 2025-05-07T19:43:00.3066790Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3066888Z cpu family : 6 2025-05-07T19:43:00.3067007Z model : 85 2025-05-07T19:43:00.3067182Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3067282Z stepping : 7 2025-05-07T19:43:00.3067382Z microcode : 0x5003901 2025-05-07T19:43:00.3067503Z cpu MHz : 2999.996 2025-05-07T19:43:00.3067602Z cache size : 36608 KB 2025-05-07T19:43:00.3067699Z physical id : 1 2025-05-07T19:43:00.3067821Z siblings : 48 2025-05-07T19:43:00.3067915Z core id : 0 2025-05-07T19:43:00.3068011Z cpu cores : 24 2025-05-07T19:43:00.3068103Z apicid : 65 2025-05-07T19:43:00.3068229Z initial apicid : 65 2025-05-07T19:43:00.3068321Z fpu : yes 2025-05-07T19:43:00.3068424Z fpu_exception : yes 2025-05-07T19:43:00.3068519Z cpuid level : 13 2025-05-07T19:43:00.3068635Z wp : yes 2025-05-07T19:43:00.3070820Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3072260Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3072371Z bogomips : 5999.99 2025-05-07T19:43:00.3072469Z clflush size : 64 2025-05-07T19:43:00.3072572Z cache_alignment : 64 2025-05-07T19:43:00.3072748Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3072854Z power management: 2025-05-07T19:43:00.3072859Z 2025-05-07T19:43:00.3072957Z processor : 73 2025-05-07T19:43:00.3073089Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3073186Z cpu family : 6 2025-05-07T19:43:00.3073280Z model : 85 2025-05-07T19:43:00.3073480Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3073574Z stepping : 7 2025-05-07T19:43:00.3073678Z microcode : 0x5003901 2025-05-07T19:43:00.3073774Z cpu MHz : 2999.996 2025-05-07T19:43:00.3073898Z cache size : 36608 KB 2025-05-07T19:43:00.3073999Z physical id : 1 2025-05-07T19:43:00.3074094Z siblings : 48 2025-05-07T19:43:00.3074192Z core id : 1 2025-05-07T19:43:00.3074313Z cpu cores : 24 2025-05-07T19:43:00.3074410Z apicid : 67 2025-05-07T19:43:00.3074509Z initial apicid : 67 2025-05-07T19:43:00.3074624Z fpu : yes 2025-05-07T19:43:00.3074726Z fpu_exception : yes 2025-05-07T19:43:00.3074822Z cpuid level : 13 2025-05-07T19:43:00.3074913Z wp : yes 2025-05-07T19:43:00.3077144Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3077558Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3077680Z bogomips : 5999.99 2025-05-07T19:43:00.3077778Z clflush size : 64 2025-05-07T19:43:00.3077879Z cache_alignment : 64 2025-05-07T19:43:00.3078024Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3078151Z power management: 2025-05-07T19:43:00.3078155Z 2025-05-07T19:43:00.3078251Z processor : 74 2025-05-07T19:43:00.3078355Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3078475Z cpu family : 6 2025-05-07T19:43:00.3078568Z model : 85 2025-05-07T19:43:00.3078744Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3078869Z stepping : 7 2025-05-07T19:43:00.3078970Z microcode : 0x5003901 2025-05-07T19:43:00.3079074Z cpu MHz : 2999.996 2025-05-07T19:43:00.3079174Z cache size : 36608 KB 2025-05-07T19:43:00.3079303Z physical id : 1 2025-05-07T19:43:00.3079399Z siblings : 48 2025-05-07T19:43:00.3079498Z core id : 2 2025-05-07T19:43:00.3079597Z cpu cores : 24 2025-05-07T19:43:00.3079718Z apicid : 69 2025-05-07T19:43:00.3079823Z initial apicid : 69 2025-05-07T19:43:00.3079917Z fpu : yes 2025-05-07T19:43:00.3080049Z fpu_exception : yes 2025-05-07T19:43:00.3080150Z cpuid level : 13 2025-05-07T19:43:00.3080247Z wp : yes 2025-05-07T19:43:00.3082462Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3082980Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3083083Z bogomips : 5999.99 2025-05-07T19:43:00.3083207Z clflush size : 64 2025-05-07T19:43:00.3083312Z cache_alignment : 64 2025-05-07T19:43:00.3083563Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3083657Z power management: 2025-05-07T19:43:00.3083661Z 2025-05-07T19:43:00.3083777Z processor : 75 2025-05-07T19:43:00.3083870Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3083961Z cpu family : 6 2025-05-07T19:43:00.3084078Z model : 85 2025-05-07T19:43:00.3084240Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3084329Z stepping : 7 2025-05-07T19:43:00.3084418Z microcode : 0x5003901 2025-05-07T19:43:00.3084504Z cpu MHz : 3221.876 2025-05-07T19:43:00.3084585Z cache size : 36608 KB 2025-05-07T19:43:00.3084667Z physical id : 1 2025-05-07T19:43:00.3084756Z siblings : 48 2025-05-07T19:43:00.3084832Z core id : 3 2025-05-07T19:43:00.3084910Z cpu cores : 24 2025-05-07T19:43:00.3084981Z apicid : 71 2025-05-07T19:43:00.3085078Z initial apicid : 71 2025-05-07T19:43:00.3085153Z fpu : yes 2025-05-07T19:43:00.3085235Z fpu_exception : yes 2025-05-07T19:43:00.3085322Z cpuid level : 13 2025-05-07T19:43:00.3085391Z wp : yes 2025-05-07T19:43:00.3087404Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3087787Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3087870Z bogomips : 5999.99 2025-05-07T19:43:00.3087949Z clflush size : 64 2025-05-07T19:43:00.3088044Z cache_alignment : 64 2025-05-07T19:43:00.3088165Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3088244Z power management: 2025-05-07T19:43:00.3088247Z 2025-05-07T19:43:00.3088321Z processor : 76 2025-05-07T19:43:00.3088421Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3088494Z cpu family : 6 2025-05-07T19:43:00.3088566Z model : 85 2025-05-07T19:43:00.3088730Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3088810Z stepping : 7 2025-05-07T19:43:00.3088884Z microcode : 0x5003901 2025-05-07T19:43:00.3088959Z cpu MHz : 3158.135 2025-05-07T19:43:00.3089048Z cache size : 36608 KB 2025-05-07T19:43:00.3089128Z physical id : 1 2025-05-07T19:43:00.3089202Z siblings : 48 2025-05-07T19:43:00.3089285Z core id : 4 2025-05-07T19:43:00.3089361Z cpu cores : 24 2025-05-07T19:43:00.3089436Z apicid : 73 2025-05-07T19:43:00.3089514Z initial apicid : 73 2025-05-07T19:43:00.3089597Z fpu : yes 2025-05-07T19:43:00.3089682Z fpu_exception : yes 2025-05-07T19:43:00.3089758Z cpuid level : 13 2025-05-07T19:43:00.3089838Z wp : yes 2025-05-07T19:43:00.3091838Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3092260Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3092346Z bogomips : 5999.99 2025-05-07T19:43:00.3092467Z clflush size : 64 2025-05-07T19:43:00.3092552Z cache_alignment : 64 2025-05-07T19:43:00.3092683Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3092766Z power management: 2025-05-07T19:43:00.3092770Z 2025-05-07T19:43:00.3092845Z processor : 77 2025-05-07T19:43:00.3092930Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3093013Z cpu family : 6 2025-05-07T19:43:00.3093082Z model : 85 2025-05-07T19:43:00.3093227Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3093318Z stepping : 7 2025-05-07T19:43:00.3093394Z microcode : 0x5003901 2025-05-07T19:43:00.3093471Z cpu MHz : 3238.835 2025-05-07T19:43:00.3093544Z cache size : 36608 KB 2025-05-07T19:43:00.3093632Z physical id : 1 2025-05-07T19:43:00.3093704Z siblings : 48 2025-05-07T19:43:00.3093774Z core id : 5 2025-05-07T19:43:00.3093857Z cpu cores : 24 2025-05-07T19:43:00.3093927Z apicid : 75 2025-05-07T19:43:00.3094009Z initial apicid : 75 2025-05-07T19:43:00.3094081Z fpu : yes 2025-05-07T19:43:00.3094174Z fpu_exception : yes 2025-05-07T19:43:00.3094251Z cpuid level : 13 2025-05-07T19:43:00.3094323Z wp : yes 2025-05-07T19:43:00.3096344Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3096710Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3096789Z bogomips : 5999.99 2025-05-07T19:43:00.3096881Z clflush size : 64 2025-05-07T19:43:00.3096963Z cache_alignment : 64 2025-05-07T19:43:00.3097083Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3097171Z power management: 2025-05-07T19:43:00.3097175Z 2025-05-07T19:43:00.3097251Z processor : 78 2025-05-07T19:43:00.3097340Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3097410Z cpu family : 6 2025-05-07T19:43:00.3097488Z model : 85 2025-05-07T19:43:00.3097636Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3097709Z stepping : 7 2025-05-07T19:43:00.3097800Z microcode : 0x5003901 2025-05-07T19:43:00.3097873Z cpu MHz : 3386.825 2025-05-07T19:43:00.3097949Z cache size : 36608 KB 2025-05-07T19:43:00.3098034Z physical id : 1 2025-05-07T19:43:00.3098122Z siblings : 48 2025-05-07T19:43:00.3098196Z core id : 6 2025-05-07T19:43:00.3098272Z cpu cores : 24 2025-05-07T19:43:00.3098366Z apicid : 77 2025-05-07T19:43:00.3098449Z initial apicid : 77 2025-05-07T19:43:00.3098519Z fpu : yes 2025-05-07T19:43:00.3098600Z fpu_exception : yes 2025-05-07T19:43:00.3098690Z cpuid level : 13 2025-05-07T19:43:00.3098765Z wp : yes 2025-05-07T19:43:00.3100766Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3101137Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3101267Z bogomips : 5999.99 2025-05-07T19:43:00.3101344Z clflush size : 64 2025-05-07T19:43:00.3101436Z cache_alignment : 64 2025-05-07T19:43:00.3101607Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3101699Z power management: 2025-05-07T19:43:00.3101702Z 2025-05-07T19:43:00.3101791Z processor : 79 2025-05-07T19:43:00.3101875Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3101950Z cpu family : 6 2025-05-07T19:43:00.3102028Z model : 85 2025-05-07T19:43:00.3102190Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3102264Z stepping : 7 2025-05-07T19:43:00.3102344Z microcode : 0x5003901 2025-05-07T19:43:00.3102430Z cpu MHz : 2999.996 2025-05-07T19:43:00.3102509Z cache size : 36608 KB 2025-05-07T19:43:00.3102584Z physical id : 1 2025-05-07T19:43:00.3102655Z siblings : 48 2025-05-07T19:43:00.3102737Z core id : 7 2025-05-07T19:43:00.3102815Z cpu cores : 24 2025-05-07T19:43:00.3102889Z apicid : 79 2025-05-07T19:43:00.3102966Z initial apicid : 79 2025-05-07T19:43:00.3103043Z fpu : yes 2025-05-07T19:43:00.3103118Z fpu_exception : yes 2025-05-07T19:43:00.3103193Z cpuid level : 13 2025-05-07T19:43:00.3103280Z wp : yes 2025-05-07T19:43:00.3105286Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3105645Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3105733Z bogomips : 5999.99 2025-05-07T19:43:00.3105815Z clflush size : 64 2025-05-07T19:43:00.3105895Z cache_alignment : 64 2025-05-07T19:43:00.3106021Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3106101Z power management: 2025-05-07T19:43:00.3106106Z 2025-05-07T19:43:00.3106179Z processor : 80 2025-05-07T19:43:00.3106273Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3106348Z cpu family : 6 2025-05-07T19:43:00.3106418Z model : 85 2025-05-07T19:43:00.3106568Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3106655Z stepping : 7 2025-05-07T19:43:00.3106729Z microcode : 0x5003901 2025-05-07T19:43:00.3106809Z cpu MHz : 3330.313 2025-05-07T19:43:00.3106889Z cache size : 36608 KB 2025-05-07T19:43:00.3106970Z physical id : 1 2025-05-07T19:43:00.3107044Z siblings : 48 2025-05-07T19:43:00.3107114Z core id : 8 2025-05-07T19:43:00.3107202Z cpu cores : 24 2025-05-07T19:43:00.3107273Z apicid : 81 2025-05-07T19:43:00.3107354Z initial apicid : 81 2025-05-07T19:43:00.3107427Z fpu : yes 2025-05-07T19:43:00.3107520Z fpu_exception : yes 2025-05-07T19:43:00.3107598Z cpuid level : 13 2025-05-07T19:43:00.3107668Z wp : yes 2025-05-07T19:43:00.3109685Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3110048Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3110188Z bogomips : 5999.99 2025-05-07T19:43:00.3110275Z clflush size : 64 2025-05-07T19:43:00.3110351Z cache_alignment : 64 2025-05-07T19:43:00.3110469Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3110555Z power management: 2025-05-07T19:43:00.3110607Z 2025-05-07T19:43:00.3110681Z processor : 81 2025-05-07T19:43:00.3110761Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3110842Z cpu family : 6 2025-05-07T19:43:00.3110923Z model : 85 2025-05-07T19:43:00.3111070Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3111149Z stepping : 7 2025-05-07T19:43:00.3111240Z microcode : 0x5003901 2025-05-07T19:43:00.3111312Z cpu MHz : 3356.629 2025-05-07T19:43:00.3111463Z cache size : 36608 KB 2025-05-07T19:43:00.3111539Z physical id : 1 2025-05-07T19:43:00.3111620Z siblings : 48 2025-05-07T19:43:00.3111688Z core id : 9 2025-05-07T19:43:00.3111927Z cpu cores : 24 2025-05-07T19:43:00.3112014Z apicid : 83 2025-05-07T19:43:00.3112095Z initial apicid : 83 2025-05-07T19:43:00.3112174Z fpu : yes 2025-05-07T19:43:00.3112260Z fpu_exception : yes 2025-05-07T19:43:00.3112352Z cpuid level : 13 2025-05-07T19:43:00.3112428Z wp : yes 2025-05-07T19:43:00.3114606Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3115012Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3115095Z bogomips : 5999.99 2025-05-07T19:43:00.3115185Z clflush size : 64 2025-05-07T19:43:00.3115285Z cache_alignment : 64 2025-05-07T19:43:00.3115415Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3115498Z power management: 2025-05-07T19:43:00.3115503Z 2025-05-07T19:43:00.3115599Z processor : 82 2025-05-07T19:43:00.3115689Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3115767Z cpu family : 6 2025-05-07T19:43:00.3115843Z model : 85 2025-05-07T19:43:00.3116008Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3116093Z stepping : 7 2025-05-07T19:43:00.3116175Z microcode : 0x5003901 2025-05-07T19:43:00.3116266Z cpu MHz : 3194.102 2025-05-07T19:43:00.3116354Z cache size : 36608 KB 2025-05-07T19:43:00.3116438Z physical id : 1 2025-05-07T19:43:00.3116516Z siblings : 48 2025-05-07T19:43:00.3116604Z core id : 10 2025-05-07T19:43:00.3116689Z cpu cores : 24 2025-05-07T19:43:00.3116769Z apicid : 85 2025-05-07T19:43:00.3116859Z initial apicid : 85 2025-05-07T19:43:00.3116938Z fpu : yes 2025-05-07T19:43:00.3117032Z fpu_exception : yes 2025-05-07T19:43:00.3117109Z cpuid level : 13 2025-05-07T19:43:00.3117192Z wp : yes 2025-05-07T19:43:00.3119369Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3119769Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3119854Z bogomips : 5999.99 2025-05-07T19:43:00.3119938Z clflush size : 64 2025-05-07T19:43:00.3120081Z cache_alignment : 64 2025-05-07T19:43:00.3120224Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3120308Z power management: 2025-05-07T19:43:00.3120313Z 2025-05-07T19:43:00.3120394Z processor : 83 2025-05-07T19:43:00.3120534Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3120613Z cpu family : 6 2025-05-07T19:43:00.3120690Z model : 85 2025-05-07T19:43:00.3120848Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3120933Z stepping : 7 2025-05-07T19:43:00.3121017Z microcode : 0x5003901 2025-05-07T19:43:00.3121094Z cpu MHz : 3197.882 2025-05-07T19:43:00.3121180Z cache size : 36608 KB 2025-05-07T19:43:00.3121259Z physical id : 1 2025-05-07T19:43:00.3121337Z siblings : 48 2025-05-07T19:43:00.3121412Z core id : 11 2025-05-07T19:43:00.3121495Z cpu cores : 24 2025-05-07T19:43:00.3121570Z apicid : 87 2025-05-07T19:43:00.3121653Z initial apicid : 87 2025-05-07T19:43:00.3121733Z fpu : yes 2025-05-07T19:43:00.3121815Z fpu_exception : yes 2025-05-07T19:43:00.3121900Z cpuid level : 13 2025-05-07T19:43:00.3121972Z wp : yes 2025-05-07T19:43:00.3124242Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3124598Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3124680Z bogomips : 5999.99 2025-05-07T19:43:00.3124754Z clflush size : 64 2025-05-07T19:43:00.3124832Z cache_alignment : 64 2025-05-07T19:43:00.3124949Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3125034Z power management: 2025-05-07T19:43:00.3125038Z 2025-05-07T19:43:00.3125111Z processor : 84 2025-05-07T19:43:00.3125193Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3125280Z cpu family : 6 2025-05-07T19:43:00.3125348Z model : 85 2025-05-07T19:43:00.3125491Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3125562Z stepping : 7 2025-05-07T19:43:00.3125650Z microcode : 0x5003901 2025-05-07T19:43:00.3125721Z cpu MHz : 3768.566 2025-05-07T19:43:00.3125804Z cache size : 36608 KB 2025-05-07T19:43:00.3125892Z physical id : 1 2025-05-07T19:43:00.3125964Z siblings : 48 2025-05-07T19:43:00.3126034Z core id : 12 2025-05-07T19:43:00.3126108Z cpu cores : 24 2025-05-07T19:43:00.3126188Z apicid : 89 2025-05-07T19:43:00.3126263Z initial apicid : 89 2025-05-07T19:43:00.3126331Z fpu : yes 2025-05-07T19:43:00.3126410Z fpu_exception : yes 2025-05-07T19:43:00.3126496Z cpuid level : 13 2025-05-07T19:43:00.3126573Z wp : yes 2025-05-07T19:43:00.3128575Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3128948Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3129025Z bogomips : 5999.99 2025-05-07T19:43:00.3129105Z clflush size : 64 2025-05-07T19:43:00.3129180Z cache_alignment : 64 2025-05-07T19:43:00.3129299Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3129427Z power management: 2025-05-07T19:43:00.3129431Z 2025-05-07T19:43:00.3129506Z processor : 85 2025-05-07T19:43:00.3129584Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3129655Z cpu family : 6 2025-05-07T19:43:00.3129727Z model : 85 2025-05-07T19:43:00.3129919Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3129991Z stepping : 7 2025-05-07T19:43:00.3130067Z microcode : 0x5003901 2025-05-07T19:43:00.3130142Z cpu MHz : 2999.996 2025-05-07T19:43:00.3130216Z cache size : 36608 KB 2025-05-07T19:43:00.3130288Z physical id : 1 2025-05-07T19:43:00.3130365Z siblings : 48 2025-05-07T19:43:00.3130434Z core id : 13 2025-05-07T19:43:00.3130504Z cpu cores : 24 2025-05-07T19:43:00.3130572Z apicid : 91 2025-05-07T19:43:00.3130655Z initial apicid : 91 2025-05-07T19:43:00.3130725Z fpu : yes 2025-05-07T19:43:00.3130801Z fpu_exception : yes 2025-05-07T19:43:00.3130871Z cpuid level : 13 2025-05-07T19:43:00.3130943Z wp : yes 2025-05-07T19:43:00.3132937Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3133303Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3133377Z bogomips : 5999.99 2025-05-07T19:43:00.3133450Z clflush size : 64 2025-05-07T19:43:00.3133527Z cache_alignment : 64 2025-05-07T19:43:00.3133650Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3133726Z power management: 2025-05-07T19:43:00.3133733Z 2025-05-07T19:43:00.3133804Z processor : 86 2025-05-07T19:43:00.3133891Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3133962Z cpu family : 6 2025-05-07T19:43:00.3134031Z model : 85 2025-05-07T19:43:00.3134187Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3134259Z stepping : 7 2025-05-07T19:43:00.3134334Z microcode : 0x5003901 2025-05-07T19:43:00.3134406Z cpu MHz : 2999.996 2025-05-07T19:43:00.3134487Z cache size : 36608 KB 2025-05-07T19:43:00.3134560Z physical id : 1 2025-05-07T19:43:00.3134633Z siblings : 48 2025-05-07T19:43:00.3134709Z core id : 14 2025-05-07T19:43:00.3134796Z cpu cores : 24 2025-05-07T19:43:00.3134868Z apicid : 93 2025-05-07T19:43:00.3134946Z initial apicid : 93 2025-05-07T19:43:00.3135035Z fpu : yes 2025-05-07T19:43:00.3135113Z fpu_exception : yes 2025-05-07T19:43:00.3135186Z cpuid level : 13 2025-05-07T19:43:00.3135255Z wp : yes 2025-05-07T19:43:00.3137283Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3137649Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3137744Z bogomips : 5999.99 2025-05-07T19:43:00.3137819Z clflush size : 64 2025-05-07T19:43:00.3137897Z cache_alignment : 64 2025-05-07T19:43:00.3138016Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3138102Z power management: 2025-05-07T19:43:00.3138106Z 2025-05-07T19:43:00.3138181Z processor : 87 2025-05-07T19:43:00.3138312Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3138395Z cpu family : 6 2025-05-07T19:43:00.3138466Z model : 85 2025-05-07T19:43:00.3138611Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3138742Z stepping : 7 2025-05-07T19:43:00.3138829Z microcode : 0x5003901 2025-05-07T19:43:00.3138903Z cpu MHz : 2999.996 2025-05-07T19:43:00.3138976Z cache size : 36608 KB 2025-05-07T19:43:00.3139060Z physical id : 1 2025-05-07T19:43:00.3139133Z siblings : 48 2025-05-07T19:43:00.3139209Z core id : 15 2025-05-07T19:43:00.3139285Z cpu cores : 24 2025-05-07T19:43:00.3139366Z apicid : 95 2025-05-07T19:43:00.3139443Z initial apicid : 95 2025-05-07T19:43:00.3139519Z fpu : yes 2025-05-07T19:43:00.3139613Z fpu_exception : yes 2025-05-07T19:43:00.3139689Z cpuid level : 13 2025-05-07T19:43:00.3139760Z wp : yes 2025-05-07T19:43:00.3141814Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3142178Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3142257Z bogomips : 5999.99 2025-05-07T19:43:00.3142350Z clflush size : 64 2025-05-07T19:43:00.3142428Z cache_alignment : 64 2025-05-07T19:43:00.3142547Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3142620Z power management: 2025-05-07T19:43:00.3142624Z 2025-05-07T19:43:00.3142713Z processor : 88 2025-05-07T19:43:00.3142792Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3142875Z cpu family : 6 2025-05-07T19:43:00.3142956Z model : 85 2025-05-07T19:43:00.3143102Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3143172Z stepping : 7 2025-05-07T19:43:00.3143252Z microcode : 0x5003901 2025-05-07T19:43:00.3143335Z cpu MHz : 2999.996 2025-05-07T19:43:00.3143414Z cache size : 36608 KB 2025-05-07T19:43:00.3143490Z physical id : 1 2025-05-07T19:43:00.3143581Z siblings : 48 2025-05-07T19:43:00.3143652Z core id : 16 2025-05-07T19:43:00.3143722Z cpu cores : 24 2025-05-07T19:43:00.3143796Z apicid : 97 2025-05-07T19:43:00.3143890Z initial apicid : 97 2025-05-07T19:43:00.3143961Z fpu : yes 2025-05-07T19:43:00.3144044Z fpu_exception : yes 2025-05-07T19:43:00.3144126Z cpuid level : 13 2025-05-07T19:43:00.3144198Z wp : yes 2025-05-07T19:43:00.3146209Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3146590Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3146666Z bogomips : 5999.99 2025-05-07T19:43:00.3146742Z clflush size : 64 2025-05-07T19:43:00.3146841Z cache_alignment : 64 2025-05-07T19:43:00.3146962Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3147038Z power management: 2025-05-07T19:43:00.3147042Z 2025-05-07T19:43:00.3147120Z processor : 89 2025-05-07T19:43:00.3147220Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3147296Z cpu family : 6 2025-05-07T19:43:00.3147422Z model : 85 2025-05-07T19:43:00.3147576Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3147651Z stepping : 7 2025-05-07T19:43:00.3147729Z microcode : 0x5003901 2025-05-07T19:43:00.3147850Z cpu MHz : 3412.810 2025-05-07T19:43:00.3147933Z cache size : 36608 KB 2025-05-07T19:43:00.3148010Z physical id : 1 2025-05-07T19:43:00.3148082Z siblings : 48 2025-05-07T19:43:00.3148168Z core id : 17 2025-05-07T19:43:00.3148240Z cpu cores : 24 2025-05-07T19:43:00.3148317Z apicid : 99 2025-05-07T19:43:00.3148396Z initial apicid : 99 2025-05-07T19:43:00.3148475Z fpu : yes 2025-05-07T19:43:00.3148554Z fpu_exception : yes 2025-05-07T19:43:00.3148627Z cpuid level : 13 2025-05-07T19:43:00.3148710Z wp : yes 2025-05-07T19:43:00.3150716Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3151078Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3151161Z bogomips : 5999.99 2025-05-07T19:43:00.3151237Z clflush size : 64 2025-05-07T19:43:00.3151384Z cache_alignment : 64 2025-05-07T19:43:00.3151533Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3151612Z power management: 2025-05-07T19:43:00.3151616Z 2025-05-07T19:43:00.3151694Z processor : 90 2025-05-07T19:43:00.3151947Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3152041Z cpu family : 6 2025-05-07T19:43:00.3152118Z model : 85 2025-05-07T19:43:00.3152281Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3152369Z stepping : 7 2025-05-07T19:43:00.3152451Z microcode : 0x5003901 2025-05-07T19:43:00.3152531Z cpu MHz : 2999.996 2025-05-07T19:43:00.3152622Z cache size : 36608 KB 2025-05-07T19:43:00.3152796Z physical id : 1 2025-05-07T19:43:00.3152877Z siblings : 48 2025-05-07T19:43:00.3152960Z core id : 18 2025-05-07T19:43:00.3153059Z cpu cores : 24 2025-05-07T19:43:00.3153140Z apicid : 101 2025-05-07T19:43:00.3153234Z initial apicid : 101 2025-05-07T19:43:00.3153314Z fpu : yes 2025-05-07T19:43:00.3153417Z fpu_exception : yes 2025-05-07T19:43:00.3153500Z cpuid level : 13 2025-05-07T19:43:00.3153578Z wp : yes 2025-05-07T19:43:00.3155790Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3156185Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3156273Z bogomips : 5999.99 2025-05-07T19:43:00.3156370Z clflush size : 64 2025-05-07T19:43:00.3156457Z cache_alignment : 64 2025-05-07T19:43:00.3156591Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3156692Z power management: 2025-05-07T19:43:00.3156696Z 2025-05-07T19:43:00.3156780Z processor : 91 2025-05-07T19:43:00.3156876Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3156966Z cpu family : 6 2025-05-07T19:43:00.3157053Z model : 85 2025-05-07T19:43:00.3157220Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3157364Z stepping : 7 2025-05-07T19:43:00.3157474Z microcode : 0x5003901 2025-05-07T19:43:00.3157556Z cpu MHz : 3110.686 2025-05-07T19:43:00.3157643Z cache size : 36608 KB 2025-05-07T19:43:00.3157782Z physical id : 1 2025-05-07T19:43:00.3157878Z siblings : 48 2025-05-07T19:43:00.3157958Z core id : 19 2025-05-07T19:43:00.3158042Z cpu cores : 24 2025-05-07T19:43:00.3158145Z apicid : 103 2025-05-07T19:43:00.3158233Z initial apicid : 103 2025-05-07T19:43:00.3158315Z fpu : yes 2025-05-07T19:43:00.3158410Z fpu_exception : yes 2025-05-07T19:43:00.3158515Z cpuid level : 13 2025-05-07T19:43:00.3158593Z wp : yes 2025-05-07T19:43:00.3160753Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3161163Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3161245Z bogomips : 5999.99 2025-05-07T19:43:00.3161326Z clflush size : 64 2025-05-07T19:43:00.3161431Z cache_alignment : 64 2025-05-07T19:43:00.3161561Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3161646Z power management: 2025-05-07T19:43:00.3161650Z 2025-05-07T19:43:00.3161750Z processor : 92 2025-05-07T19:43:00.3161848Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3161926Z cpu family : 6 2025-05-07T19:43:00.3162003Z model : 85 2025-05-07T19:43:00.3162182Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3162267Z stepping : 7 2025-05-07T19:43:00.3162353Z microcode : 0x5003901 2025-05-07T19:43:00.3162448Z cpu MHz : 2999.996 2025-05-07T19:43:00.3162532Z cache size : 36608 KB 2025-05-07T19:43:00.3162612Z physical id : 1 2025-05-07T19:43:00.3162694Z siblings : 48 2025-05-07T19:43:00.3162802Z core id : 20 2025-05-07T19:43:00.3162883Z cpu cores : 24 2025-05-07T19:43:00.3162959Z apicid : 105 2025-05-07T19:43:00.3163044Z initial apicid : 105 2025-05-07T19:43:00.3163134Z fpu : yes 2025-05-07T19:43:00.3163220Z fpu_exception : yes 2025-05-07T19:43:00.3163297Z cpuid level : 13 2025-05-07T19:43:00.3163386Z wp : yes 2025-05-07T19:43:00.3165724Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3166124Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3166222Z bogomips : 5999.99 2025-05-07T19:43:00.3166305Z clflush size : 64 2025-05-07T19:43:00.3166391Z cache_alignment : 64 2025-05-07T19:43:00.3166533Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3166622Z power management: 2025-05-07T19:43:00.3166627Z 2025-05-07T19:43:00.3166705Z processor : 93 2025-05-07T19:43:00.3166796Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3166876Z cpu family : 6 2025-05-07T19:43:00.3166953Z model : 85 2025-05-07T19:43:00.3167109Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3167195Z stepping : 7 2025-05-07T19:43:00.3167368Z microcode : 0x5003901 2025-05-07T19:43:00.3167449Z cpu MHz : 2999.996 2025-05-07T19:43:00.3167537Z cache size : 36608 KB 2025-05-07T19:43:00.3167617Z physical id : 1 2025-05-07T19:43:00.3167693Z siblings : 48 2025-05-07T19:43:00.3167767Z core id : 21 2025-05-07T19:43:00.3167916Z cpu cores : 24 2025-05-07T19:43:00.3167991Z apicid : 107 2025-05-07T19:43:00.3168072Z initial apicid : 107 2025-05-07T19:43:00.3168150Z fpu : yes 2025-05-07T19:43:00.3168245Z fpu_exception : yes 2025-05-07T19:43:00.3168328Z cpuid level : 13 2025-05-07T19:43:00.3168412Z wp : yes 2025-05-07T19:43:00.3170599Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3171002Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3171093Z bogomips : 5999.99 2025-05-07T19:43:00.3171196Z clflush size : 64 2025-05-07T19:43:00.3171278Z cache_alignment : 64 2025-05-07T19:43:00.3171411Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3171513Z power management: 2025-05-07T19:43:00.3171517Z 2025-05-07T19:43:00.3171603Z processor : 94 2025-05-07T19:43:00.3171690Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3171780Z cpu family : 6 2025-05-07T19:43:00.3171866Z model : 85 2025-05-07T19:43:00.3172032Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3172113Z stepping : 7 2025-05-07T19:43:00.3172215Z microcode : 0x5003901 2025-05-07T19:43:00.3172305Z cpu MHz : 2999.996 2025-05-07T19:43:00.3172388Z cache size : 36608 KB 2025-05-07T19:43:00.3172475Z physical id : 1 2025-05-07T19:43:00.3172576Z siblings : 48 2025-05-07T19:43:00.3172665Z core id : 22 2025-05-07T19:43:00.3172744Z cpu cores : 24 2025-05-07T19:43:00.3172836Z apicid : 109 2025-05-07T19:43:00.3172928Z initial apicid : 109 2025-05-07T19:43:00.3173013Z fpu : yes 2025-05-07T19:43:00.3173101Z fpu_exception : yes 2025-05-07T19:43:00.3173189Z cpuid level : 13 2025-05-07T19:43:00.3173264Z wp : yes 2025-05-07T19:43:00.3175434Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3175841Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3175929Z bogomips : 5999.99 2025-05-07T19:43:00.3176015Z clflush size : 64 2025-05-07T19:43:00.3176109Z cache_alignment : 64 2025-05-07T19:43:00.3176238Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3176330Z power management: 2025-05-07T19:43:00.3176334Z 2025-05-07T19:43:00.3176424Z processor : 95 2025-05-07T19:43:00.3176521Z vendor_id : GenuineIntel 2025-05-07T19:43:00.3176603Z cpu family : 6 2025-05-07T19:43:00.3176680Z model : 85 2025-05-07T19:43:00.3176952Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.3177030Z stepping : 7 2025-05-07T19:43:00.3177110Z microcode : 0x5003901 2025-05-07T19:43:00.3177190Z cpu MHz : 2999.996 2025-05-07T19:43:00.3177321Z cache size : 36608 KB 2025-05-07T19:43:00.3177396Z physical id : 1 2025-05-07T19:43:00.3177467Z siblings : 48 2025-05-07T19:43:00.3177555Z core id : 23 2025-05-07T19:43:00.3177628Z cpu cores : 24 2025-05-07T19:43:00.3177702Z apicid : 111 2025-05-07T19:43:00.3177861Z initial apicid : 111 2025-05-07T19:43:00.3177938Z fpu : yes 2025-05-07T19:43:00.3178017Z fpu_exception : yes 2025-05-07T19:43:00.3178092Z cpuid level : 13 2025-05-07T19:43:00.3178172Z wp : yes 2025-05-07T19:43:00.3180170Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.3180552Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.3180634Z bogomips : 5999.99 2025-05-07T19:43:00.3180712Z clflush size : 64 2025-05-07T19:43:00.3180791Z cache_alignment : 64 2025-05-07T19:43:00.3180932Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.3181012Z power management: 2025-05-07T19:43:00.3181016Z 2025-05-07T19:43:00.3181020Z 2025-05-07T19:43:00.3181129Z ################################################################################ 2025-05-07T19:43:00.3181246Z [INFO] Print PCI info ... 2025-05-07T19:43:00.3181324Z + lspci -v 2025-05-07T19:43:00.3181328Z 2025-05-07T19:43:00.3181506Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:00.3181626Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:00.3181733Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:00.3181741Z 2025-05-07T19:43:00.3181925Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:00.3182021Z Physical Slot: 1 2025-05-07T19:43:00.3182130Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:00.3182135Z 2025-05-07T19:43:00.3182378Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:00.3182454Z Physical Slot: 1 2025-05-07T19:43:00.3182594Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:00.3182598Z 2025-05-07T19:43:00.3182854Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:00.3182935Z Physical Slot: 3 2025-05-07T19:43:00.3183059Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:00.3183186Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:00.3183305Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:00.3183308Z 2025-05-07T19:43:00.3183621Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:00.3183721Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:00.3183801Z Physical Slot: 4 2025-05-07T19:43:00.3183946Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:00.3184090Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:00.3184190Z Capabilities: 2025-05-07T19:43:00.3184282Z Kernel driver in use: nvme 2025-05-07T19:43:00.3184287Z 2025-05-07T19:43:00.3184503Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:00.3184588Z Physical Slot: 5 2025-05-07T19:43:00.3184700Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:00.3184860Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:00.3184995Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:00.3185131Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:00.3185281Z Capabilities: 2025-05-07T19:43:00.3185389Z Kernel driver in use: ena 2025-05-07T19:43:00.3185393Z 2025-05-07T19:43:00.3185396Z 2025-05-07T19:43:00.3185543Z ################################################################################ 2025-05-07T19:43:00.3185648Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:00.3185733Z + uname -a 2025-05-07T19:43:00.3185737Z 2025-05-07T19:43:00.3186108Z Linux 2c96c3f709dd 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:00.3186112Z 2025-05-07T19:43:00.3186184Z + uname -m 2025-05-07T19:43:00.3186188Z 2025-05-07T19:43:00.3186268Z x86_64 2025-05-07T19:43:00.3186272Z 2025-05-07T19:43:00.3186349Z + cat /proc/version 2025-05-07T19:43:00.3186353Z 2025-05-07T19:43:00.3186903Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:00.3186920Z 2025-05-07T19:43:00.3187005Z + cat /etc/os-release 2025-05-07T19:43:00.3187009Z 2025-05-07T19:43:00.3187086Z NAME="Amazon Linux" 2025-05-07T19:43:00.3187164Z VERSION="2023" 2025-05-07T19:43:00.3187249Z ID="amzn" 2025-05-07T19:43:00.3187329Z ID_LIKE="fedora" 2025-05-07T19:43:00.3187415Z VERSION_ID="2023" 2025-05-07T19:43:00.3187521Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:00.3187622Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:00.3187694Z ANSI_COLOR="0;33" 2025-05-07T19:43:00.3187806Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:00.3187990Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:00.3188148Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:00.3188294Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:00.3188493Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:00.3188571Z VENDOR_NAME="AWS" 2025-05-07T19:43:00.3188688Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:00.3188781Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:00.3188799Z 2025-05-07T19:43:00.3225314Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:00.3225467Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:00.3225796Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:00.3225876Z env: 2025-05-07T19:43:00.3225985Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:00.3226074Z BUILD_ENV: build_binary 2025-05-07T19:43:00.3226171Z BUILD_TARGET: default 2025-05-07T19:43:00.3226252Z BUILD_VARIANT: cuda 2025-05-07T19:43:00.3226338Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:00.3226431Z ##[endgroup] 2025-05-07T19:43:00.7655525Z ################################################################################ 2025-05-07T19:43:00.7656648Z [INFO] Printing general display info ... 2025-05-07T19:43:00.7669204Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:00.8541897Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:00.8547047Z /usr/bin/sudo 2025-05-07T19:43:00.8555867Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:00.8563782Z /usr/bin/yum 2025-05-07T19:43:00.8564089Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:00.8586601Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:01.0723821Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:43:01.1675124Z Dependencies resolved. 2025-05-07T19:43:01.1892095Z Nothing to do. 2025-05-07T19:43:01.1892800Z Complete! 2025-05-07T19:43:01.2644031Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:01.2667193Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:01.4772970Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:43:01.5284794Z Dependencies resolved. 2025-05-07T19:43:01.5452130Z ================================================================================ 2025-05-07T19:43:01.5453756Z Package Arch Version Repository Size 2025-05-07T19:43:01.5454180Z ================================================================================ 2025-05-07T19:43:01.5454524Z Installing: 2025-05-07T19:43:01.5454870Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:01.5455348Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:01.5455640Z 2025-05-07T19:43:01.5455753Z Transaction Summary 2025-05-07T19:43:01.5456011Z ================================================================================ 2025-05-07T19:43:01.5456360Z Install 2 Packages 2025-05-07T19:43:01.5456505Z 2025-05-07T19:43:01.5456612Z Total download size: 347 k 2025-05-07T19:43:01.5456899Z Installed size: 883 k 2025-05-07T19:43:01.5457167Z Downloading Packages: 2025-05-07T19:43:01.8612293Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.3 MB/s | 28 kB 00:00 2025-05-07T19:43:01.8650046Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 13 MB/s | 319 kB 00:00 2025-05-07T19:43:01.8665126Z -------------------------------------------------------------------------------- 2025-05-07T19:43:01.8668128Z Total 1.1 MB/s | 347 kB 00:00 2025-05-07T19:43:01.8914698Z Running transaction check 2025-05-07T19:43:01.8970035Z Transaction check succeeded. 2025-05-07T19:43:01.8970385Z Running transaction test 2025-05-07T19:43:01.9135240Z Transaction test succeeded. 2025-05-07T19:43:01.9135586Z Running transaction 2025-05-07T19:43:01.9425242Z Preparing : 1/1 2025-05-07T19:43:01.9503586Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:01.9540583Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:02.9991014Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:02.9992198Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:03.0371028Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:03.0371526Z 2025-05-07T19:43:03.0371983Z Installed: 2025-05-07T19:43:03.0372400Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:03.0372751Z 2025-05-07T19:43:03.0372861Z Complete! 2025-05-07T19:43:03.0820320Z + hostname 2025-05-07T19:43:03.0820758Z 2025-05-07T19:43:03.0830556Z 2c96c3f709dd 2025-05-07T19:43:03.0830999Z 2025-05-07T19:43:03.0831991Z + sudo lshw -C display 2025-05-07T19:43:03.0832646Z 2025-05-07T19:43:03.2826576Z *-display UNCLAIMED 2025-05-07T19:43:03.2827497Z description: VGA compatible controller 2025-05-07T19:43:03.2828492Z product: Amazon.com, Inc. 2025-05-07T19:43:03.2829301Z vendor: Amazon.com, Inc. 2025-05-07T19:43:03.2830115Z physical id: 3 2025-05-07T19:43:03.2830802Z bus info: pci@0000:00:03.0 2025-05-07T19:43:03.2831851Z version: 00 2025-05-07T19:43:03.2832480Z width: 32 bits 2025-05-07T19:43:03.2833131Z clock: 33MHz 2025-05-07T19:43:03.2833609Z capabilities: vga_controller bus_master 2025-05-07T19:43:03.2833965Z configuration: latency=0 2025-05-07T19:43:03.2834313Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:03.2846688Z 2025-05-07T19:43:03.2847282Z ################################################################################ 2025-05-07T19:43:03.2848368Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:03.2954140Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:03.2976239Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.2977676Z [CHECK] nvidia-smi not found 2025-05-07T19:43:03.2978534Z ################################################################################ 2025-05-07T19:43:03.2980003Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:03.3078917Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:03.3104543Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.3105073Z [CHECK] rocminfo not found 2025-05-07T19:43:03.3109676Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.3110172Z [CHECK] rocm-smi not found 2025-05-07T19:43:03.3202162Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:03.3202697Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:03.3203312Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:03.3203664Z env: 2025-05-07T19:43:03.3203942Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:03.3204367Z BUILD_ENV: build_binary 2025-05-07T19:43:03.3204651Z BUILD_TARGET: default 2025-05-07T19:43:03.3204893Z BUILD_VARIANT: cuda 2025-05-07T19:43:03.3205191Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:03.3205454Z ##[endgroup] 2025-05-07T19:43:03.7571412Z ################################################################################ 2025-05-07T19:43:03.7572471Z # Setup Miniconda 2025-05-07T19:43:03.7573123Z # 2025-05-07T19:43:03.7598147Z # [2025-05-07T19:43:03.759Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:03.7598640Z ################################################################################ 2025-05-07T19:43:03.7598985Z 2025-05-07T19:43:03.7626074Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:03.8433971Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:03.8434546Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:03.8434799Z 2025-05-07T19:43:03.8453640Z 2025-05-07T19:43:03.8454104Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:03.8486793Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:05.5238917Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:05.5240021Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:05.5240815Z 2025-05-07T19:43:05.5391108Z PREFIX=/github/home/miniconda 2025-05-07T19:43:05.8910415Z Unpacking payload ... 2025-05-07T19:43:06.3751618Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:07.0477134Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:08.8930956Z 2025-05-07T19:43:08.8931573Z Installing base environment... 2025-05-07T19:43:08.8932187Z 2025-05-07T19:43:09.8940718Z Preparing transaction: ...working... done 2025-05-07T19:43:12.7755478Z Executing transaction: ...working... done 2025-05-07T19:43:13.3249182Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:13.3937094Z installation finished. 2025-05-07T19:43:13.3937677Z 2025-05-07T19:43:13.3938771Z + rm -f miniconda.sh 2025-05-07T19:43:13.3939342Z 2025-05-07T19:43:13.4110523Z 2025-05-07T19:43:13.4111125Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:13.4112438Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:13.4113128Z 2025-05-07T19:43:13.7723120Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:13.7724318Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:13.7725018Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:13.7725415Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:13.7725789Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:13.7726214Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:13.7727134Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:13.7727587Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:13.7728078Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:13.7728629Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:13.7729450Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:13.7729814Z modified /github/home/.bashrc 2025-05-07T19:43:13.7730027Z 2025-05-07T19:43:13.7730239Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:13.7730540Z 2025-05-07T19:43:13.8239051Z 2025-05-07T19:43:13.8239946Z + . /github/home/.bashrc 2025-05-07T19:43:13.8240519Z 2025-05-07T19:43:14.6071948Z 2025-05-07T19:43:14.6072423Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:14.6097887Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:26.4096094Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:27.8507981Z Solving environment: / - \ | / - \ | / - \ done 2025-05-07T19:43:27.9394833Z 2025-05-07T19:43:27.9395925Z ## Package Plan ## 2025-05-07T19:43:27.9396534Z 2025-05-07T19:43:27.9396929Z environment location: /github/home/miniconda 2025-05-07T19:43:27.9397651Z 2025-05-07T19:43:27.9397927Z added / updated specs: 2025-05-07T19:43:27.9398736Z - conda-libmamba-solver 2025-05-07T19:43:27.9399344Z - libarchive 2025-05-07T19:43:27.9399582Z - libmamba 2025-05-07T19:43:27.9399817Z - libmambapy 2025-05-07T19:43:27.9399950Z 2025-05-07T19:43:27.9399954Z 2025-05-07T19:43:27.9400083Z The following packages will be downloaded: 2025-05-07T19:43:27.9400309Z 2025-05-07T19:43:27.9400445Z package | build 2025-05-07T19:43:27.9400781Z ---------------------------|----------------- 2025-05-07T19:43:27.9401235Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:27.9401732Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:27.9402190Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:27.9402698Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:27.9403160Z ------------------------------------------------------------ 2025-05-07T19:43:27.9403534Z Total: 1.4 MB 2025-05-07T19:43:27.9403752Z 2025-05-07T19:43:27.9403870Z The following packages will be UPDATED: 2025-05-07T19:43:27.9404100Z 2025-05-07T19:43:27.9409875Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:27.9410692Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:27.9411119Z 2025-05-07T19:43:27.9411344Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:27.9411668Z 2025-05-07T19:43:27.9412009Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:27.9412813Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:27.9413324Z 2025-05-07T19:43:27.9413329Z 2025-05-07T19:43:27.9413627Z 2025-05-07T19:43:27.9413780Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:27.9414188Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:27.9414414Z 2025-05-07T19:43:27.9414723Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:27.9414984Z 2025-05-07T19:43:27.9414988Z 2025-05-07T19:43:27.9424055Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:27.9424413Z 2025-05-07T19:43:27.9424420Z 2025-05-07T19:43:27.9424774Z 2025-05-07T19:43:27.9850523Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:27.9850882Z 2025-05-07T19:43:27.9964909Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:27.9965465Z 2025-05-07T19:43:27.9976918Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:28.0040240Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:28.0041012Z 2025-05-07T19:43:28.0041045Z 2025-05-07T19:43:28.0144669Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:28.0144981Z 2025-05-07T19:43:28.0145002Z 2025-05-07T19:43:28.0145008Z 2025-05-07T19:43:28.0151633Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:28.0152262Z 2025-05-07T19:43:28.0152267Z 2025-05-07T19:43:28.0294979Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:28.0295876Z 2025-05-07T19:43:28.0295890Z 2025-05-07T19:43:28.0295903Z 2025-05-07T19:43:28.1072545Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:28.1074006Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:28.1074648Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:28.1075078Z 2025-05-07T19:43:28.1075319Z 2025-05-07T19:43:28.1075524Z  2025-05-07T19:43:28.1075795Z 2025-05-07T19:43:28.1075800Z 2025-05-07T19:43:28.1075993Z  2025-05-07T19:43:28.1076213Z 2025-05-07T19:43:28.1076236Z 2025-05-07T19:43:28.1076240Z 2025-05-07T19:43:28.1076489Z  done 2025-05-07T19:43:28.2090149Z Preparing transaction: / done 2025-05-07T19:43:28.3097588Z Verifying transaction: \ done 2025-05-07T19:43:29.6126083Z Executing transaction: / - \ | / - \ | / - \ | / done 2025-05-07T19:43:31.1474363Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:31.1496694Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:31.8651372Z Channels: 2025-05-07T19:43:31.8651635Z - defaults 2025-05-07T19:43:31.8651871Z Platform: linux-64 2025-05-07T19:43:32.9390703Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:33.0712032Z Solving environment: / - Channels: 2025-05-07T19:43:33.0713013Z - defaults 2025-05-07T19:43:33.0713694Z Platform: linux-64 2025-05-07T19:43:33.3486457Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:33.5682334Z Solving environment: / - \ done 2025-05-07T19:43:33.6836374Z | done 2025-05-07T19:43:33.7469365Z 2025-05-07T19:43:33.7470229Z ## Package Plan ## 2025-05-07T19:43:33.7470477Z 2025-05-07T19:43:33.7470629Z environment location: /github/home/miniconda 2025-05-07T19:43:33.7470903Z 2025-05-07T19:43:33.7471006Z added / updated specs: 2025-05-07T19:43:33.7471316Z - conda 2025-05-07T19:43:33.7471542Z 2025-05-07T19:43:33.7471547Z 2025-05-07T19:43:33.7471691Z The following packages will be downloaded: 2025-05-07T19:43:33.7471920Z 2025-05-07T19:43:33.7472042Z package | build 2025-05-07T19:43:33.7472389Z ---------------------------|----------------- 2025-05-07T19:43:33.7472745Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:33.7473172Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:33.7473912Z ------------------------------------------------------------ 2025-05-07T19:43:33.7474276Z Total: 1.4 MB 2025-05-07T19:43:33.7474495Z 2025-05-07T19:43:33.7474632Z The following packages will be UPDATED: 2025-05-07T19:43:33.7474849Z 2025-05-07T19:43:33.7475179Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:33.7475868Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:33.7476136Z 2025-05-07T19:43:33.7476140Z 2025-05-07T19:43:33.7476143Z 2025-05-07T19:43:33.7476306Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:33.7476701Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:33.7476943Z 2025-05-07T19:43:33.7867695Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:33.7867979Z 2025-05-07T19:43:33.8169785Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.0484674Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.0485466Z 2025-05-07T19:43:34.0486771Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.0487546Z 2025-05-07T19:43:34.0578410Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.0579570Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.0581010Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.0581967Z 2025-05-07T19:43:34.0582576Z 2025-05-07T19:43:34.0583104Z  done 2025-05-07T19:43:34.1593914Z Preparing transaction: - done 2025-05-07T19:43:34.2611757Z Verifying transaction: | done 2025-05-07T19:43:36.2646927Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:36.8105038Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:36.8105556Z + conda clean --packages --tarball -y 2025-05-07T19:43:36.8105793Z 2025-05-07T19:43:37.2466697Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:37.2467683Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:37.3010974Z 2025-05-07T19:43:37.3013906Z + conda clean --all -y 2025-05-07T19:43:37.3014439Z 2025-05-07T19:43:37.7477983Z There are no unused tarball(s) to remove. 2025-05-07T19:43:37.7478408Z Will remove 1 index cache(s). 2025-05-07T19:43:37.7478709Z There are no unused package(s) to remove. 2025-05-07T19:43:37.7479163Z There are no tempfile(s) to remove. 2025-05-07T19:43:37.7479449Z There are no logfile(s) to remove. 2025-05-07T19:43:37.8026597Z 2025-05-07T19:43:37.8026986Z + conda info 2025-05-07T19:43:37.8027577Z 2025-05-07T19:43:38.3850033Z 2025-05-07T19:43:38.3850636Z active environment : base 2025-05-07T19:43:38.3851663Z active env location : /github/home/miniconda 2025-05-07T19:43:38.3852692Z shell level : 1 2025-05-07T19:43:38.3853591Z user config file : /github/home/.condarc 2025-05-07T19:43:38.3854755Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:38.3855880Z conda version : 25.3.1 2025-05-07T19:43:38.3856724Z conda-build version : not installed 2025-05-07T19:43:38.3857339Z python version : 3.13.2.final.0 2025-05-07T19:43:38.3857653Z solver : libmamba (default) 2025-05-07T19:43:38.3858045Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:38.3858405Z __conda=25.3.1=0 2025-05-07T19:43:38.3858705Z __glibc=2.34=0 2025-05-07T19:43:38.3859022Z __linux=6.1.130=0 2025-05-07T19:43:38.3859315Z __unix=0=0 2025-05-07T19:43:38.3859684Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:38.3860098Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:38.3860487Z conda av metadata url : None 2025-05-07T19:43:38.3861240Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:38.3861708Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:38.3862123Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:38.3862513Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:38.3862921Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:38.3863426Z /github/home/.conda/pkgs 2025-05-07T19:43:38.3863815Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:38.3864172Z /github/home/.conda/envs 2025-05-07T19:43:38.3864525Z platform : linux-64 2025-05-07T19:43:38.3865860Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:38.3866803Z UID:GID : 0:0 2025-05-07T19:43:38.3867126Z netrc file : None 2025-05-07T19:43:38.3867424Z offline mode : False 2025-05-07T19:43:38.3867645Z 2025-05-07T19:43:38.4442670Z 2025-05-07T19:43:38.4443030Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:38.4443759Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_951de4fe-6001-4b98-b744-4cfbaab1c1f0 ... 2025-05-07T19:43:38.4444508Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:38.4595135Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.12 2025-05-07T19:43:38.4595746Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.12 2025-05-07T19:43:38.4596620Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:38.4597001Z env: 2025-05-07T19:43:38.4597251Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:38.4597607Z BUILD_ENV: build_binary 2025-05-07T19:43:38.4597902Z BUILD_TARGET: default 2025-05-07T19:43:38.4598182Z BUILD_VARIANT: cuda 2025-05-07T19:43:38.4598439Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:38.4598737Z ##[endgroup] 2025-05-07T19:43:38.8939230Z ################################################################################ 2025-05-07T19:43:38.8939659Z # Create Conda Environment 2025-05-07T19:43:38.8939928Z # 2025-05-07T19:43:38.8952969Z # [2025-05-07T19:43:38.894Z] + create_conda_environment build_binary 3.12 2025-05-07T19:43:38.8953510Z ################################################################################ 2025-05-07T19:43:38.8953748Z 2025-05-07T19:43:38.8968332Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:38.9819269Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:38.9820416Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:38.9820762Z + conda info --envs 2025-05-07T19:43:38.9820899Z 2025-05-07T19:43:39.5639467Z 2025-05-07T19:43:39.5640016Z # conda environments: 2025-05-07T19:43:39.5640320Z # 2025-05-07T19:43:39.5641218Z base /github/home/miniconda 2025-05-07T19:43:39.5641494Z 2025-05-07T19:43:39.6246841Z 2025-05-07T19:43:39.6247655Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:41.2620902Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:41.2621720Z 2025-05-07T19:43:41.2634675Z 2025-05-07T19:43:41.2642788Z [SETUP] Creating new Conda environment (Python 3.12) ... 2025-05-07T19:43:41.2667489Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.12 2025-05-07T19:43:41.8381455Z Channels: 2025-05-07T19:43:41.8382146Z - defaults 2025-05-07T19:43:41.8382775Z Platform: linux-64 2025-05-07T19:43:43.2477872Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:43.3479915Z Solving environment: | done 2025-05-07T19:43:43.3769904Z 2025-05-07T19:43:43.3770557Z ## Package Plan ## 2025-05-07T19:43:43.3770760Z 2025-05-07T19:43:43.3771550Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:43.3771922Z 2025-05-07T19:43:43.3772036Z added / updated specs: 2025-05-07T19:43:43.3772319Z - python=3.12 2025-05-07T19:43:43.3772473Z 2025-05-07T19:43:43.3772476Z 2025-05-07T19:43:43.3772614Z The following packages will be downloaded: 2025-05-07T19:43:43.3772888Z 2025-05-07T19:43:43.3773016Z package | build 2025-05-07T19:43:43.3773370Z ---------------------------|----------------- 2025-05-07T19:43:43.3773824Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:43.3774295Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:43.3774749Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:43.3775224Z python-3.12.9 | h5148396_0 34.7 MB 2025-05-07T19:43:43.3775654Z setuptools-78.1.1 | py312h06a4308_0 2.2 MB 2025-05-07T19:43:43.3776116Z wheel-0.45.1 | py312h06a4308_0 147 KB 2025-05-07T19:43:43.3776515Z ------------------------------------------------------------ 2025-05-07T19:43:43.3776908Z Total: 37.2 MB 2025-05-07T19:43:43.3777141Z 2025-05-07T19:43:43.3777311Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:43.3777558Z 2025-05-07T19:43:43.3777789Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:43.3778300Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:43.3779013Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:43.3779577Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:43.3780138Z expat pkgs/main/linux-64::expat-2.7.1-h6a678d5_0 2025-05-07T19:43:43.3780633Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:43.3781274Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:43.3781741Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:43.3782234Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:43.3782725Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:43.3783236Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:43.3783702Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:43.3784157Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:43.3784613Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:43.3785039Z python pkgs/main/linux-64::python-3.12.9-h5148396_0 2025-05-07T19:43:43.3785512Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:43.3786033Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py312h06a4308_0 2025-05-07T19:43:43.3786534Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:43.3786970Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:43.3787377Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:43.3787846Z wheel pkgs/main/linux-64::wheel-0.45.1-py312h06a4308_0 2025-05-07T19:43:43.3788290Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:43.3788688Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:43.3789004Z 2025-05-07T19:43:43.3789008Z 2025-05-07T19:43:43.3789012Z 2025-05-07T19:43:43.3789171Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:43.3789579Z python-3.12.9 | 34.7 MB | | 0% 2025-05-07T19:43:43.3789849Z 2025-05-07T19:43:43.3790182Z setuptools-78.1.1 | 2.2 MB | | 0%  2025-05-07T19:43:43.3790447Z 2025-05-07T19:43:43.3790451Z 2025-05-07T19:43:43.3790815Z wheel-0.45.1 | 147 KB | | 0%  2025-05-07T19:43:43.3791068Z 2025-05-07T19:43:43.3791073Z 2025-05-07T19:43:43.3791076Z 2025-05-07T19:43:43.3793225Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:43.3793552Z 2025-05-07T19:43:43.3793556Z 2025-05-07T19:43:43.3793559Z 2025-05-07T19:43:43.3793563Z 2025-05-07T19:43:43.3805533Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:43.3805908Z 2025-05-07T19:43:43.3805913Z 2025-05-07T19:43:43.3805917Z 2025-05-07T19:43:43.3805921Z 2025-05-07T19:43:43.3805942Z 2025-05-07T19:43:43.4170054Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:43.4170387Z 2025-05-07T19:43:43.4170423Z 2025-05-07T19:43:43.4170427Z 2025-05-07T19:43:43.4170432Z 2025-05-07T19:43:43.4256810Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:43.4257146Z 2025-05-07T19:43:43.4257151Z 2025-05-07T19:43:43.4286549Z wheel-0.45.1 | 147 KB | ########## | 100%  2025-05-07T19:43:43.4286936Z 2025-05-07T19:43:43.4286941Z 2025-05-07T19:43:43.4286945Z 2025-05-07T19:43:43.4380944Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:43.4381396Z 2025-05-07T19:43:43.4381504Z 2025-05-07T19:43:43.4381521Z 2025-05-07T19:43:43.4381527Z 2025-05-07T19:43:43.4381545Z 2025-05-07T19:43:43.4462109Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:43.4462449Z 2025-05-07T19:43:43.4462465Z 2025-05-07T19:43:43.4462468Z 2025-05-07T19:43:43.4568672Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:43.4569633Z 2025-05-07T19:43:43.4569666Z 2025-05-07T19:43:43.4569680Z 2025-05-07T19:43:43.4569692Z 2025-05-07T19:43:43.4569703Z 2025-05-07T19:43:43.4668556Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:43.4668914Z 2025-05-07T19:43:43.4668930Z 2025-05-07T19:43:43.4668935Z 2025-05-07T19:43:43.4668974Z 2025-05-07T19:43:43.4754103Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:43.4755031Z 2025-05-07T19:43:43.4755725Z setuptools-78.1.1 | 2.2 MB | ########## | 100%  2025-05-07T19:43:43.4756498Z 2025-05-07T19:43:43.4756511Z 2025-05-07T19:43:43.4767093Z wheel-0.45.1 | 147 KB | ########## | 100%  2025-05-07T19:43:43.5803245Z python-3.12.9 | 34.7 MB | 4 | 5% 2025-05-07T19:43:43.7056967Z python-3.12.9 | 34.7 MB | #####5 | 55% 2025-05-07T19:43:43.7057784Z 2025-05-07T19:43:43.7058484Z setuptools-78.1.1 | 2.2 MB | ########## | 100%  2025-05-07T19:43:43.7058779Z 2025-05-07T19:43:43.7626620Z setuptools-78.1.1 | 2.2 MB | ########## | 100%  2025-05-07T19:43:43.7627096Z python-3.12.9 | 34.7 MB | ########## | 100% 2025-05-07T19:43:44.2987223Z python-3.12.9 | 34.7 MB | ########## | 100% 2025-05-07T19:43:44.2989516Z python-3.12.9 | 34.7 MB | ########## | 100% 2025-05-07T19:43:44.2990354Z 2025-05-07T19:43:44.2990631Z 2025-05-07T19:43:44.2990896Z  2025-05-07T19:43:44.2991128Z 2025-05-07T19:43:44.2991133Z 2025-05-07T19:43:44.2991453Z  2025-05-07T19:43:44.2991706Z 2025-05-07T19:43:44.2991710Z 2025-05-07T19:43:44.2991714Z 2025-05-07T19:43:44.2991913Z  2025-05-07T19:43:44.2992200Z 2025-05-07T19:43:44.2992204Z 2025-05-07T19:43:44.2992219Z 2025-05-07T19:43:44.2992224Z 2025-05-07T19:43:44.2992414Z  2025-05-07T19:43:44.2992686Z 2025-05-07T19:43:44.2992689Z 2025-05-07T19:43:44.2992693Z 2025-05-07T19:43:44.2992696Z 2025-05-07T19:43:44.2992700Z 2025-05-07T19:43:44.2992915Z  done 2025-05-07T19:43:44.5104799Z Preparing transaction: - \ done 2025-05-07T19:43:46.0373076Z Verifying transaction: / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:48.2536537Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:48.2575776Z # 2025-05-07T19:43:48.2576179Z # To activate this environment, use 2025-05-07T19:43:48.2576809Z # 2025-05-07T19:43:48.2577369Z # $ conda activate build_binary 2025-05-07T19:43:48.2577666Z # 2025-05-07T19:43:48.2577931Z # To deactivate an active environment, use 2025-05-07T19:43:48.2578269Z # 2025-05-07T19:43:48.2578513Z # $ conda deactivate 2025-05-07T19:43:48.2578691Z 2025-05-07T19:43:48.3412611Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:48.3438136Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:51.3118431Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:51.3120070Z 2025-05-07T19:43:51.3120540Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (25.1) 2025-05-07T19:43:51.3121216Z Collecting pip 2025-05-07T19:43:51.3121570Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:51.3122325Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:51.3123342Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 57.2 MB/s eta 0:00:00 2025-05-07T19:43:51.3123778Z Installing collected packages: pip 2025-05-07T19:43:51.3124148Z Attempting uninstall: pip 2025-05-07T19:43:51.3124465Z Found existing installation: pip 25.1 2025-05-07T19:43:51.3124835Z Uninstalling pip-25.1: 2025-05-07T19:43:51.3125147Z Successfully uninstalled pip-25.1 2025-05-07T19:43:51.3125537Z Successfully installed pip-25.1.1 2025-05-07T19:43:51.3125748Z 2025-05-07T19:43:51.3898139Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:51.3923521Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:52.0511730Z Channels: 2025-05-07T19:43:52.0512196Z - conda-forge 2025-05-07T19:43:52.0512506Z Platform: linux-64 2025-05-07T19:44:01.5893733Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:03.4928846Z Solving environment: | / - \ | done 2025-05-07T19:44:03.5389927Z 2025-05-07T19:44:03.5390408Z ## Package Plan ## 2025-05-07T19:44:03.5390635Z 2025-05-07T19:44:03.5390902Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:03.5391251Z 2025-05-07T19:44:03.5391468Z added / updated specs: 2025-05-07T19:44:03.5391802Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:03.5392059Z 2025-05-07T19:44:03.5392063Z 2025-05-07T19:44:03.5392263Z The following packages will be downloaded: 2025-05-07T19:44:03.5392502Z 2025-05-07T19:44:03.5392636Z package | build 2025-05-07T19:44:03.5393015Z ---------------------------|----------------- 2025-05-07T19:44:03.5393415Z cffi-1.17.1 | py312h06ac9bb_0 288 KB conda-forge 2025-05-07T19:44:03.5393932Z cryptography-44.0.3 | py312hda17c39_0 1.5 MB conda-forge 2025-05-07T19:44:03.5394455Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:44:03.5394906Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:44:03.5395383Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:03.5395840Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:03.5396332Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:03.5397117Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:44:03.5397606Z libsqlite-3.46.0 | hde9e2c9_0 845 KB conda-forge 2025-05-07T19:44:03.5398096Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:44:03.5398559Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:03.5399050Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:03.5399515Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:03.5400013Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:03.5400501Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:03.5401014Z python-3.12.2 |hab00c5b_0_cpython 30.8 MB conda-forge 2025-05-07T19:44:03.5401519Z python_abi-3.12 | 7_cp312 7 KB conda-forge 2025-05-07T19:44:03.5402021Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:03.5402581Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:03.5403059Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:03.5403530Z ------------------------------------------------------------ 2025-05-07T19:44:03.5403941Z Total: 38.6 MB 2025-05-07T19:44:03.5404347Z 2025-05-07T19:44:03.5404529Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:03.5404784Z 2025-05-07T19:44:03.5405026Z cffi conda-forge/linux-64::cffi-1.17.1-py312h06ac9bb_0 2025-05-07T19:44:03.5405624Z cryptography conda-forge/linux-64::cryptography-44.0.3-py312hda17c39_0 2025-05-07T19:44:03.5406191Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:44:03.5406716Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:03.5407214Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:44:03.5407802Z libsqlite conda-forge/linux-64::libsqlite-3.46.0-hde9e2c9_0 2025-05-07T19:44:03.5408363Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:03.5408869Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:03.5409416Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:03.5409980Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:03.5410492Z python_abi conda-forge/noarch::python_abi-3.12-7_cp312 2025-05-07T19:44:03.5411095Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:03.5411746Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:03.5412168Z 2025-05-07T19:44:03.5412301Z The following packages will be UPDATED: 2025-05-07T19:44:03.5412537Z 2025-05-07T19:44:03.5413003Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:03.5413883Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:03.5414688Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:03.5415425Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:44:03.5416122Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:03.5416818Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:03.5417184Z 2025-05-07T19:44:03.5417436Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:44:03.5417899Z 2025-05-07T19:44:03.5418168Z expat pkgs/main::expat-2.7.1-h6a678d5_0 --> conda-forge::expat-2.7.0-h5888daf_0 2025-05-07T19:44:03.5418880Z python pkgs/main::python-3.12.9-h5148396_0 --> conda-forge::python-3.12.2-hab00c5b_0_cpython 2025-05-07T19:44:03.5419304Z 2025-05-07T19:44:03.5419308Z 2025-05-07T19:44:03.5419312Z 2025-05-07T19:44:03.5419471Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:03.5419927Z python-3.12.2 | 30.8 MB | | 0% 2025-05-07T19:44:03.5420181Z 2025-05-07T19:44:03.5420681Z openssl-3.5.0 | 3.0 MB | | 0%  2025-05-07T19:44:03.5420942Z 2025-05-07T19:44:03.5420946Z 2025-05-07T19:44:03.5421186Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:03.5421499Z 2025-05-07T19:44:03.5421503Z 2025-05-07T19:44:03.5421507Z 2025-05-07T19:44:03.5424869Z libsqlite-3.46.0 | 845 KB | | 0%  2025-05-07T19:44:03.5425161Z 2025-05-07T19:44:03.5425165Z 2025-05-07T19:44:03.5425247Z 2025-05-07T19:44:03.5425341Z 2025-05-07T19:44:03.5447876Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:03.5448185Z 2025-05-07T19:44:03.5448322Z 2025-05-07T19:44:03.5448326Z 2025-05-07T19:44:03.5448329Z 2025-05-07T19:44:03.5448333Z 2025-05-07T19:44:03.5448625Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:03.5448929Z 2025-05-07T19:44:03.5448933Z 2025-05-07T19:44:03.5448936Z 2025-05-07T19:44:03.5449102Z 2025-05-07T19:44:03.5449107Z 2025-05-07T19:44:03.5449111Z 2025-05-07T19:44:03.5449370Z cffi-1.17.1 | 288 KB | | 0%  2025-05-07T19:44:03.5449676Z 2025-05-07T19:44:03.5449679Z 2025-05-07T19:44:03.5449683Z 2025-05-07T19:44:03.5449686Z 2025-05-07T19:44:03.5449690Z 2025-05-07T19:44:03.5449694Z 2025-05-07T19:44:03.5449697Z 2025-05-07T19:44:03.5449939Z expat-2.7.0 | 137 KB | | 0%  2025-05-07T19:44:03.5450262Z 2025-05-07T19:44:03.5450266Z 2025-05-07T19:44:03.5450269Z 2025-05-07T19:44:03.5450273Z 2025-05-07T19:44:03.5450276Z 2025-05-07T19:44:03.5450280Z 2025-05-07T19:44:03.5450283Z 2025-05-07T19:44:03.5450287Z 2025-05-07T19:44:03.5450564Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:03.5450894Z 2025-05-07T19:44:03.5450898Z 2025-05-07T19:44:03.5450901Z 2025-05-07T19:44:03.5450905Z 2025-05-07T19:44:03.5450908Z 2025-05-07T19:44:03.5450912Z 2025-05-07T19:44:03.5450920Z 2025-05-07T19:44:03.5450923Z 2025-05-07T19:44:03.5450927Z 2025-05-07T19:44:03.5451201Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:03.5451533Z 2025-05-07T19:44:03.5451536Z 2025-05-07T19:44:03.5451540Z 2025-05-07T19:44:03.5451544Z 2025-05-07T19:44:03.5451547Z 2025-05-07T19:44:03.5451551Z 2025-05-07T19:44:03.5451554Z 2025-05-07T19:44:03.5451557Z 2025-05-07T19:44:03.5451564Z 2025-05-07T19:44:03.5451568Z 2025-05-07T19:44:03.5451917Z libxcrypt-4.4.36 | 98 KB | | 0%  2025-05-07T19:44:03.5452258Z 2025-05-07T19:44:03.5452261Z 2025-05-07T19:44:03.5452265Z 2025-05-07T19:44:03.5452268Z 2025-05-07T19:44:03.5452271Z 2025-05-07T19:44:03.5452275Z 2025-05-07T19:44:03.5452278Z 2025-05-07T19:44:03.5452282Z 2025-05-07T19:44:03.5452285Z 2025-05-07T19:44:03.5452289Z 2025-05-07T19:44:03.5452296Z 2025-05-07T19:44:03.5455083Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:03.5455370Z 2025-05-07T19:44:03.5455373Z 2025-05-07T19:44:03.5455377Z 2025-05-07T19:44:03.5455380Z 2025-05-07T19:44:03.5455384Z 2025-05-07T19:44:03.5455387Z 2025-05-07T19:44:03.5455391Z 2025-05-07T19:44:03.5455394Z 2025-05-07T19:44:03.5455398Z 2025-05-07T19:44:03.5455401Z 2025-05-07T19:44:03.5455405Z 2025-05-07T19:44:03.5455412Z 2025-05-07T19:44:03.5456144Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:03.5456578Z 2025-05-07T19:44:03.5456582Z 2025-05-07T19:44:03.5456585Z 2025-05-07T19:44:03.5456595Z 2025-05-07T19:44:03.5456598Z 2025-05-07T19:44:03.5456602Z 2025-05-07T19:44:03.5456605Z 2025-05-07T19:44:03.5456609Z 2025-05-07T19:44:03.5456612Z 2025-05-07T19:44:03.5456644Z 2025-05-07T19:44:03.5456647Z 2025-05-07T19:44:03.5456651Z 2025-05-07T19:44:03.5457096Z 2025-05-07T19:44:03.5459309Z libexpat-2.7.0 | 73 KB | | 0%  2025-05-07T19:44:03.5459632Z 2025-05-07T19:44:03.5459642Z 2025-05-07T19:44:03.5459677Z 2025-05-07T19:44:03.5459680Z 2025-05-07T19:44:03.5459684Z 2025-05-07T19:44:03.5459687Z 2025-05-07T19:44:03.5459691Z 2025-05-07T19:44:03.5459694Z 2025-05-07T19:44:03.5459697Z 2025-05-07T19:44:03.5459701Z 2025-05-07T19:44:03.5459704Z 2025-05-07T19:44:03.5459708Z 2025-05-07T19:44:03.5459711Z 2025-05-07T19:44:03.5459714Z 2025-05-07T19:44:03.5460403Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:03.5460743Z 2025-05-07T19:44:03.5460752Z 2025-05-07T19:44:03.5460756Z 2025-05-07T19:44:03.5460759Z 2025-05-07T19:44:03.5460763Z 2025-05-07T19:44:03.5460766Z 2025-05-07T19:44:03.5460770Z 2025-05-07T19:44:03.5460773Z 2025-05-07T19:44:03.5460777Z 2025-05-07T19:44:03.5460780Z 2025-05-07T19:44:03.5460784Z 2025-05-07T19:44:03.5460788Z 2025-05-07T19:44:03.5460791Z 2025-05-07T19:44:03.5460795Z 2025-05-07T19:44:03.5460798Z 2025-05-07T19:44:03.5461685Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:03.5462034Z 2025-05-07T19:44:03.5462038Z 2025-05-07T19:44:03.5462042Z 2025-05-07T19:44:03.5462045Z 2025-05-07T19:44:03.5462049Z 2025-05-07T19:44:03.5462052Z 2025-05-07T19:44:03.5462056Z 2025-05-07T19:44:03.5462059Z 2025-05-07T19:44:03.5462063Z 2025-05-07T19:44:03.5462072Z 2025-05-07T19:44:03.5462076Z 2025-05-07T19:44:03.5462083Z 2025-05-07T19:44:03.5462119Z 2025-05-07T19:44:03.5462122Z 2025-05-07T19:44:03.5462126Z 2025-05-07T19:44:03.5462129Z 2025-05-07T19:44:03.5462582Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:03.5462902Z 2025-05-07T19:44:03.5462906Z 2025-05-07T19:44:03.5462909Z 2025-05-07T19:44:03.5462913Z 2025-05-07T19:44:03.5462946Z 2025-05-07T19:44:03.5462950Z 2025-05-07T19:44:03.5462953Z 2025-05-07T19:44:03.5462957Z 2025-05-07T19:44:03.5462960Z 2025-05-07T19:44:03.5462964Z 2025-05-07T19:44:03.5462972Z 2025-05-07T19:44:03.5462976Z 2025-05-07T19:44:03.5462979Z 2025-05-07T19:44:03.5462982Z 2025-05-07T19:44:03.5462990Z 2025-05-07T19:44:03.5462993Z 2025-05-07T19:44:03.5462997Z 2025-05-07T19:44:03.5463612Z libuuid-2.38.1 | 33 KB | | 0%  2025-05-07T19:44:03.5463962Z 2025-05-07T19:44:03.5463966Z 2025-05-07T19:44:03.5463969Z 2025-05-07T19:44:03.5463977Z 2025-05-07T19:44:03.5463980Z 2025-05-07T19:44:03.5463984Z 2025-05-07T19:44:03.5463987Z 2025-05-07T19:44:03.5463997Z 2025-05-07T19:44:03.5464001Z 2025-05-07T19:44:03.5464004Z 2025-05-07T19:44:03.5464007Z 2025-05-07T19:44:03.5464011Z 2025-05-07T19:44:03.5464014Z 2025-05-07T19:44:03.5464018Z 2025-05-07T19:44:03.5464021Z 2025-05-07T19:44:03.5464025Z 2025-05-07T19:44:03.5464028Z 2025-05-07T19:44:03.5464063Z 2025-05-07T19:44:03.5464903Z libnsl-2.0.1 | 33 KB | | 0%  2025-05-07T19:44:03.5465247Z 2025-05-07T19:44:03.5465250Z 2025-05-07T19:44:03.5465254Z 2025-05-07T19:44:03.5465257Z 2025-05-07T19:44:03.5465261Z 2025-05-07T19:44:03.5465296Z 2025-05-07T19:44:03.5465299Z 2025-05-07T19:44:03.5465303Z 2025-05-07T19:44:03.5465306Z 2025-05-07T19:44:03.5465310Z 2025-05-07T19:44:03.5465313Z 2025-05-07T19:44:03.5465316Z 2025-05-07T19:44:03.5465320Z 2025-05-07T19:44:03.5465323Z 2025-05-07T19:44:03.5465327Z 2025-05-07T19:44:03.5465458Z 2025-05-07T19:44:03.5465463Z 2025-05-07T19:44:03.5465467Z 2025-05-07T19:44:03.5465470Z 2025-05-07T19:44:03.6219193Z ... (more hidden) ... 2025-05-07T19:44:03.6219573Z 2025-05-07T19:44:03.6219577Z 2025-05-07T19:44:03.6219584Z 2025-05-07T19:44:03.6219590Z 2025-05-07T19:44:03.6393010Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.6396478Z python-3.12.2 | 30.8 MB | #5 | 16% 2025-05-07T19:44:03.6396754Z 2025-05-07T19:44:03.6400497Z openssl-3.5.0 | 3.0 MB | ##### | 51%  2025-05-07T19:44:03.6400788Z 2025-05-07T19:44:03.6401793Z 2025-05-07T19:44:03.6466045Z cryptography-44.0.3 | 1.5 MB | 9 | 9%  2025-05-07T19:44:03.6466377Z 2025-05-07T19:44:03.6466382Z 2025-05-07T19:44:03.6466387Z 2025-05-07T19:44:03.6466674Z libsqlite-3.46.0 | 845 KB | ########## | 100%  2025-05-07T19:44:03.6466962Z 2025-05-07T19:44:03.6466992Z 2025-05-07T19:44:03.6466996Z 2025-05-07T19:44:03.6664384Z libsqlite-3.46.0 | 845 KB | ########## | 100%  2025-05-07T19:44:03.6664901Z 2025-05-07T19:44:03.6664907Z 2025-05-07T19:44:03.6664911Z 2025-05-07T19:44:03.6664915Z 2025-05-07T19:44:03.6664919Z 2025-05-07T19:44:03.6779683Z libgomp-15.1.0 | 442 KB | 3 | 4%  2025-05-07T19:44:03.6780033Z 2025-05-07T19:44:03.6780038Z 2025-05-07T19:44:03.6780042Z 2025-05-07T19:44:03.6780046Z 2025-05-07T19:44:03.6780049Z 2025-05-07T19:44:03.6930477Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.6930818Z 2025-05-07T19:44:03.6930823Z 2025-05-07T19:44:03.6930856Z 2025-05-07T19:44:03.6930863Z 2025-05-07T19:44:03.6930866Z 2025-05-07T19:44:03.6930870Z 2025-05-07T19:44:03.6975556Z cffi-1.17.1 | 288 KB | 5 | 6%  2025-05-07T19:44:03.6975873Z 2025-05-07T19:44:03.6975878Z 2025-05-07T19:44:03.6975889Z 2025-05-07T19:44:03.7014821Z libsqlite-3.46.0 | 845 KB | ########## | 100%  2025-05-07T19:44:03.7015175Z 2025-05-07T19:44:03.7015179Z 2025-05-07T19:44:03.7015183Z 2025-05-07T19:44:03.7015187Z 2025-05-07T19:44:03.7015190Z 2025-05-07T19:44:03.7015194Z 2025-05-07T19:44:03.7075516Z cffi-1.17.1 | 288 KB | ########## | 100%  2025-05-07T19:44:03.7075828Z 2025-05-07T19:44:03.7075838Z 2025-05-07T19:44:03.7099469Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:03.7099866Z 2025-05-07T19:44:03.7209026Z openssl-3.5.0 | 3.0 MB | ########## | 100%  2025-05-07T19:44:03.7209344Z 2025-05-07T19:44:03.7209349Z 2025-05-07T19:44:03.7209378Z 2025-05-07T19:44:03.7209381Z 2025-05-07T19:44:03.7209628Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.7209901Z 2025-05-07T19:44:03.7209905Z 2025-05-07T19:44:03.7209909Z 2025-05-07T19:44:03.7209913Z 2025-05-07T19:44:03.7393583Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.7440563Z python-3.12.2 | 30.8 MB | ###9 | 40% 2025-05-07T19:44:03.7440913Z 2025-05-07T19:44:03.7440919Z 2025-05-07T19:44:03.7440924Z 2025-05-07T19:44:03.7440929Z 2025-05-07T19:44:03.7440935Z 2025-05-07T19:44:03.7440940Z 2025-05-07T19:44:03.7440945Z 2025-05-07T19:44:03.7468373Z expat-2.7.0 | 137 KB | #1 | 12%  2025-05-07T19:44:03.7468703Z 2025-05-07T19:44:03.7468707Z 2025-05-07T19:44:03.7468711Z 2025-05-07T19:44:03.7468715Z 2025-05-07T19:44:03.7468718Z 2025-05-07T19:44:03.7468723Z 2025-05-07T19:44:03.7468750Z 2025-05-07T19:44:03.7468759Z 2025-05-07T19:44:03.7481120Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:03.7481456Z 2025-05-07T19:44:03.7481460Z 2025-05-07T19:44:03.7481463Z 2025-05-07T19:44:03.7481467Z 2025-05-07T19:44:03.7481473Z 2025-05-07T19:44:03.7508918Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.7509472Z 2025-05-07T19:44:03.7509477Z 2025-05-07T19:44:03.7509503Z 2025-05-07T19:44:03.7509507Z 2025-05-07T19:44:03.7509510Z 2025-05-07T19:44:03.7509514Z 2025-05-07T19:44:03.7509517Z 2025-05-07T19:44:03.7509521Z 2025-05-07T19:44:03.7521348Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:03.7521663Z 2025-05-07T19:44:03.7521667Z 2025-05-07T19:44:03.7521671Z 2025-05-07T19:44:03.7521693Z 2025-05-07T19:44:03.7521697Z 2025-05-07T19:44:03.7521700Z 2025-05-07T19:44:03.7525137Z 2025-05-07T19:44:03.7554099Z expat-2.7.0 | 137 KB | ########## | 100%  2025-05-07T19:44:03.7554426Z 2025-05-07T19:44:03.7554431Z 2025-05-07T19:44:03.7554435Z 2025-05-07T19:44:03.7554462Z 2025-05-07T19:44:03.7554466Z 2025-05-07T19:44:03.7554469Z 2025-05-07T19:44:03.7554473Z 2025-05-07T19:44:03.7554476Z 2025-05-07T19:44:03.7554480Z 2025-05-07T19:44:03.7554483Z 2025-05-07T19:44:03.7603635Z libxcrypt-4.4.36 | 98 KB | #6 | 16%  2025-05-07T19:44:03.7604062Z 2025-05-07T19:44:03.7604068Z 2025-05-07T19:44:03.7604073Z 2025-05-07T19:44:03.7604077Z 2025-05-07T19:44:03.7604082Z 2025-05-07T19:44:03.7604087Z 2025-05-07T19:44:03.7604091Z 2025-05-07T19:44:03.7604096Z 2025-05-07T19:44:03.7604100Z 2025-05-07T19:44:03.7604105Z 2025-05-07T19:44:03.7631724Z libxcrypt-4.4.36 | 98 KB | ########## | 100%  2025-05-07T19:44:03.7632074Z 2025-05-07T19:44:03.7632079Z 2025-05-07T19:44:03.7632083Z 2025-05-07T19:44:03.7632087Z 2025-05-07T19:44:03.7632301Z 2025-05-07T19:44:03.7632307Z 2025-05-07T19:44:03.7632310Z 2025-05-07T19:44:03.7632314Z 2025-05-07T19:44:03.7632317Z 2025-05-07T19:44:03.7668888Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:03.7669219Z 2025-05-07T19:44:03.7669224Z 2025-05-07T19:44:03.7669228Z 2025-05-07T19:44:03.7669232Z 2025-05-07T19:44:03.7669236Z 2025-05-07T19:44:03.7669240Z 2025-05-07T19:44:03.7669256Z 2025-05-07T19:44:03.7669260Z 2025-05-07T19:44:03.7669263Z 2025-05-07T19:44:03.7889022Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:03.7889362Z 2025-05-07T19:44:03.7889368Z 2025-05-07T19:44:03.7889372Z 2025-05-07T19:44:03.7889377Z 2025-05-07T19:44:03.7889382Z 2025-05-07T19:44:03.7889385Z 2025-05-07T19:44:03.7953092Z cffi-1.17.1 | 288 KB | ########## | 100%  2025-05-07T19:44:03.7953402Z 2025-05-07T19:44:03.7953407Z 2025-05-07T19:44:03.7953411Z 2025-05-07T19:44:03.7953414Z 2025-05-07T19:44:03.7953434Z 2025-05-07T19:44:03.7953438Z 2025-05-07T19:44:03.7953441Z 2025-05-07T19:44:03.7953445Z 2025-05-07T19:44:03.7953448Z 2025-05-07T19:44:03.7953452Z 2025-05-07T19:44:03.7953475Z 2025-05-07T19:44:03.7953479Z 2025-05-07T19:44:03.7953483Z 2025-05-07T19:44:03.7986273Z libexpat-2.7.0 | 73 KB | ##2 | 22%  2025-05-07T19:44:03.7986613Z 2025-05-07T19:44:03.7986636Z 2025-05-07T19:44:03.7986640Z 2025-05-07T19:44:03.7986644Z 2025-05-07T19:44:03.7986672Z 2025-05-07T19:44:03.7986675Z 2025-05-07T19:44:03.7986679Z 2025-05-07T19:44:03.7986682Z 2025-05-07T19:44:03.7986686Z 2025-05-07T19:44:03.7986689Z 2025-05-07T19:44:03.7986693Z 2025-05-07T19:44:03.7986696Z 2025-05-07T19:44:03.7986700Z 2025-05-07T19:44:03.8089346Z libexpat-2.7.0 | 73 KB | ########## | 100%  2025-05-07T19:44:03.8089712Z 2025-05-07T19:44:03.8089716Z 2025-05-07T19:44:03.8089720Z 2025-05-07T19:44:03.8089724Z 2025-05-07T19:44:03.8089744Z 2025-05-07T19:44:03.8089748Z 2025-05-07T19:44:03.8089752Z 2025-05-07T19:44:03.8089755Z 2025-05-07T19:44:03.8089759Z 2025-05-07T19:44:03.8089762Z 2025-05-07T19:44:03.8089765Z 2025-05-07T19:44:03.8089769Z 2025-05-07T19:44:03.8089772Z 2025-05-07T19:44:03.8089776Z 2025-05-07T19:44:03.8096913Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:03.8097437Z 2025-05-07T19:44:03.8097441Z 2025-05-07T19:44:03.8097444Z 2025-05-07T19:44:03.8097448Z 2025-05-07T19:44:03.8097451Z 2025-05-07T19:44:03.8097462Z 2025-05-07T19:44:03.8097465Z 2025-05-07T19:44:03.8097469Z 2025-05-07T19:44:03.8097472Z 2025-05-07T19:44:03.8097476Z 2025-05-07T19:44:03.8097479Z 2025-05-07T19:44:03.8097483Z 2025-05-07T19:44:03.8115182Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:03.8115552Z 2025-05-07T19:44:03.8115556Z 2025-05-07T19:44:03.8115560Z 2025-05-07T19:44:03.8115564Z 2025-05-07T19:44:03.8115580Z 2025-05-07T19:44:03.8115584Z 2025-05-07T19:44:03.8115587Z 2025-05-07T19:44:03.8115591Z 2025-05-07T19:44:03.8115595Z 2025-05-07T19:44:03.8115598Z 2025-05-07T19:44:03.8115602Z 2025-05-07T19:44:03.8120667Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:03.8120950Z 2025-05-07T19:44:03.8120954Z 2025-05-07T19:44:03.8120957Z 2025-05-07T19:44:03.8120968Z 2025-05-07T19:44:03.8120972Z 2025-05-07T19:44:03.8120975Z 2025-05-07T19:44:03.8120979Z 2025-05-07T19:44:03.8120982Z 2025-05-07T19:44:03.8120986Z 2025-05-07T19:44:03.8121011Z 2025-05-07T19:44:03.8121015Z 2025-05-07T19:44:03.8121018Z 2025-05-07T19:44:03.8121022Z 2025-05-07T19:44:03.8121540Z 2025-05-07T19:44:03.8162796Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:03.8163143Z 2025-05-07T19:44:03.8163174Z 2025-05-07T19:44:03.8163178Z 2025-05-07T19:44:03.8163181Z 2025-05-07T19:44:03.8163185Z 2025-05-07T19:44:03.8163375Z 2025-05-07T19:44:03.8163380Z 2025-05-07T19:44:03.8163384Z 2025-05-07T19:44:03.8163387Z 2025-05-07T19:44:03.8163390Z 2025-05-07T19:44:03.8163394Z 2025-05-07T19:44:03.8163397Z 2025-05-07T19:44:03.8176207Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:03.8176589Z 2025-05-07T19:44:03.8176593Z 2025-05-07T19:44:03.8176597Z 2025-05-07T19:44:03.8176608Z 2025-05-07T19:44:03.8176612Z 2025-05-07T19:44:03.8176615Z 2025-05-07T19:44:03.8176618Z 2025-05-07T19:44:03.8176622Z 2025-05-07T19:44:03.8176625Z 2025-05-07T19:44:03.8176629Z 2025-05-07T19:44:03.8176636Z 2025-05-07T19:44:03.8321166Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:03.8321512Z 2025-05-07T19:44:03.8321517Z 2025-05-07T19:44:03.8321521Z 2025-05-07T19:44:03.8321525Z 2025-05-07T19:44:03.8321528Z 2025-05-07T19:44:03.8321532Z 2025-05-07T19:44:03.8321535Z 2025-05-07T19:44:03.8321539Z 2025-05-07T19:44:03.8321558Z 2025-05-07T19:44:03.8321562Z 2025-05-07T19:44:03.8321566Z 2025-05-07T19:44:03.8321570Z 2025-05-07T19:44:03.8321573Z 2025-05-07T19:44:03.8321577Z 2025-05-07T19:44:03.8321580Z 2025-05-07T19:44:03.8351523Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:03.8351917Z 2025-05-07T19:44:03.8351922Z 2025-05-07T19:44:03.8351926Z 2025-05-07T19:44:03.8351947Z 2025-05-07T19:44:03.8351951Z 2025-05-07T19:44:03.8351954Z 2025-05-07T19:44:03.8351958Z 2025-05-07T19:44:03.8351961Z 2025-05-07T19:44:03.8351965Z 2025-05-07T19:44:03.8351990Z 2025-05-07T19:44:03.8351994Z 2025-05-07T19:44:03.8351997Z 2025-05-07T19:44:03.8352001Z 2025-05-07T19:44:03.8352004Z 2025-05-07T19:44:03.8352008Z 2025-05-07T19:44:03.8394807Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:03.8396000Z python-3.12.2 | 30.8 MB | ######2 | 62% 2025-05-07T19:44:03.8396256Z 2025-05-07T19:44:03.8396284Z 2025-05-07T19:44:03.8396288Z 2025-05-07T19:44:03.8396291Z 2025-05-07T19:44:03.8396295Z 2025-05-07T19:44:03.8396298Z 2025-05-07T19:44:03.8396302Z 2025-05-07T19:44:03.8396413Z 2025-05-07T19:44:03.8698433Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:03.8698807Z 2025-05-07T19:44:03.8698813Z 2025-05-07T19:44:03.8698820Z 2025-05-07T19:44:03.8698826Z 2025-05-07T19:44:03.8699041Z 2025-05-07T19:44:03.8699045Z 2025-05-07T19:44:03.8699050Z 2025-05-07T19:44:03.8699054Z 2025-05-07T19:44:03.8699058Z 2025-05-07T19:44:03.8699063Z 2025-05-07T19:44:03.8699068Z 2025-05-07T19:44:03.8699071Z 2025-05-07T19:44:03.8699076Z 2025-05-07T19:44:03.8699079Z 2025-05-07T19:44:03.8699083Z 2025-05-07T19:44:03.8699086Z 2025-05-07T19:44:03.8717014Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:03.8717337Z 2025-05-07T19:44:03.8717341Z 2025-05-07T19:44:03.8717345Z 2025-05-07T19:44:03.8717363Z 2025-05-07T19:44:03.8717366Z 2025-05-07T19:44:03.8717370Z 2025-05-07T19:44:03.8717373Z 2025-05-07T19:44:03.8717377Z 2025-05-07T19:44:03.8717380Z 2025-05-07T19:44:03.8717384Z 2025-05-07T19:44:03.8717387Z 2025-05-07T19:44:03.8717391Z 2025-05-07T19:44:03.8717394Z 2025-05-07T19:44:03.8717398Z 2025-05-07T19:44:03.8717401Z 2025-05-07T19:44:03.8723013Z 2025-05-07T19:44:03.8739001Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:03.8739368Z 2025-05-07T19:44:03.8739373Z 2025-05-07T19:44:03.8739377Z 2025-05-07T19:44:03.8739381Z 2025-05-07T19:44:03.8739384Z 2025-05-07T19:44:03.8739388Z 2025-05-07T19:44:03.8739404Z 2025-05-07T19:44:03.8742466Z expat-2.7.0 | 137 KB | ########## | 100%  2025-05-07T19:44:03.8742739Z 2025-05-07T19:44:03.8742743Z 2025-05-07T19:44:03.8742746Z 2025-05-07T19:44:03.8742750Z 2025-05-07T19:44:03.8742753Z 2025-05-07T19:44:03.8742757Z 2025-05-07T19:44:03.8742765Z 2025-05-07T19:44:03.8758336Z expat-2.7.0 | 137 KB | ########## | 100%  2025-05-07T19:44:03.8758623Z 2025-05-07T19:44:03.8758627Z 2025-05-07T19:44:03.8758630Z 2025-05-07T19:44:03.8758634Z 2025-05-07T19:44:03.8758637Z 2025-05-07T19:44:03.8758641Z 2025-05-07T19:44:03.8758644Z 2025-05-07T19:44:03.8758648Z 2025-05-07T19:44:03.8758672Z 2025-05-07T19:44:03.8758676Z 2025-05-07T19:44:03.8758679Z 2025-05-07T19:44:03.8758688Z 2025-05-07T19:44:03.8758691Z 2025-05-07T19:44:03.8758695Z 2025-05-07T19:44:03.8758698Z 2025-05-07T19:44:03.8758702Z 2025-05-07T19:44:03.8758705Z 2025-05-07T19:44:03.8758709Z 2025-05-07T19:44:03.8758890Z 2025-05-07T19:44:03.8764014Z ... (more hidden) ... 2025-05-07T19:44:03.8764317Z 2025-05-07T19:44:03.8764320Z 2025-05-07T19:44:03.8764324Z 2025-05-07T19:44:03.8764327Z 2025-05-07T19:44:03.8764331Z 2025-05-07T19:44:03.8764334Z 2025-05-07T19:44:03.8764338Z 2025-05-07T19:44:03.8764341Z 2025-05-07T19:44:03.8764350Z 2025-05-07T19:44:03.8764353Z 2025-05-07T19:44:03.8764357Z 2025-05-07T19:44:03.8764360Z 2025-05-07T19:44:03.8764364Z 2025-05-07T19:44:03.8764368Z 2025-05-07T19:44:03.8764371Z 2025-05-07T19:44:03.8764375Z 2025-05-07T19:44:03.8764378Z 2025-05-07T19:44:03.8764386Z 2025-05-07T19:44:03.8764403Z 2025-05-07T19:44:03.8881276Z ... (more hidden) ... 2025-05-07T19:44:03.8881629Z 2025-05-07T19:44:03.8881634Z 2025-05-07T19:44:03.8881638Z 2025-05-07T19:44:03.8881641Z 2025-05-07T19:44:03.8881645Z 2025-05-07T19:44:03.8881648Z 2025-05-07T19:44:03.8881667Z 2025-05-07T19:44:03.8881670Z 2025-05-07T19:44:03.8881674Z 2025-05-07T19:44:03.8881678Z 2025-05-07T19:44:03.8881681Z 2025-05-07T19:44:03.8881685Z 2025-05-07T19:44:03.8881688Z 2025-05-07T19:44:03.8881692Z 2025-05-07T19:44:03.8881695Z 2025-05-07T19:44:03.8881699Z 2025-05-07T19:44:03.8881703Z 2025-05-07T19:44:03.8882229Z 2025-05-07T19:44:03.8900322Z libnsl-2.0.1 | 33 KB | ####9 | 49%  2025-05-07T19:44:03.8900676Z 2025-05-07T19:44:03.8900794Z 2025-05-07T19:44:03.8900802Z 2025-05-07T19:44:03.8900807Z 2025-05-07T19:44:03.8900812Z 2025-05-07T19:44:03.8900816Z 2025-05-07T19:44:03.8900821Z 2025-05-07T19:44:03.8900825Z 2025-05-07T19:44:03.8900830Z 2025-05-07T19:44:03.8900861Z 2025-05-07T19:44:03.8900866Z 2025-05-07T19:44:03.8901054Z 2025-05-07T19:44:03.8901059Z 2025-05-07T19:44:03.8901064Z 2025-05-07T19:44:03.8901068Z 2025-05-07T19:44:03.8901073Z 2025-05-07T19:44:03.8901077Z 2025-05-07T19:44:03.8901082Z 2025-05-07T19:44:03.9070225Z libnsl-2.0.1 | 33 KB | ########## | 100%  2025-05-07T19:44:03.9070575Z 2025-05-07T19:44:03.9070579Z 2025-05-07T19:44:03.9070595Z 2025-05-07T19:44:03.9070599Z 2025-05-07T19:44:03.9070602Z 2025-05-07T19:44:03.9070606Z 2025-05-07T19:44:03.9070609Z 2025-05-07T19:44:03.9070613Z 2025-05-07T19:44:03.9070630Z 2025-05-07T19:44:03.9071098Z 2025-05-07T19:44:03.9076020Z libxcrypt-4.4.36 | 98 KB | ########## | 100%  2025-05-07T19:44:03.9076336Z 2025-05-07T19:44:03.9076340Z 2025-05-07T19:44:03.9076343Z 2025-05-07T19:44:03.9076346Z 2025-05-07T19:44:03.9076350Z 2025-05-07T19:44:03.9076353Z 2025-05-07T19:44:03.9076357Z 2025-05-07T19:44:03.9076361Z 2025-05-07T19:44:03.9076365Z 2025-05-07T19:44:03.9076381Z 2025-05-07T19:44:03.9315287Z libxcrypt-4.4.36 | 98 KB | ########## | 100%  2025-05-07T19:44:03.9315642Z 2025-05-07T19:44:03.9315647Z 2025-05-07T19:44:03.9315651Z 2025-05-07T19:44:03.9315654Z 2025-05-07T19:44:03.9315658Z 2025-05-07T19:44:03.9315661Z 2025-05-07T19:44:03.9315665Z 2025-05-07T19:44:03.9315668Z 2025-05-07T19:44:03.9315672Z 2025-05-07T19:44:03.9315675Z 2025-05-07T19:44:03.9315679Z 2025-05-07T19:44:03.9315682Z 2025-05-07T19:44:03.9315686Z 2025-05-07T19:44:03.9315689Z 2025-05-07T19:44:03.9315693Z 2025-05-07T19:44:03.9315919Z 2025-05-07T19:44:03.9315939Z 2025-05-07T19:44:03.9331480Z libuuid-2.38.1 | 33 KB | ####8 | 49%  2025-05-07T19:44:03.9331837Z 2025-05-07T19:44:03.9331842Z 2025-05-07T19:44:03.9331846Z 2025-05-07T19:44:03.9331850Z 2025-05-07T19:44:03.9331879Z 2025-05-07T19:44:03.9331883Z 2025-05-07T19:44:03.9331886Z 2025-05-07T19:44:03.9331890Z 2025-05-07T19:44:03.9331906Z 2025-05-07T19:44:03.9331909Z 2025-05-07T19:44:03.9331913Z 2025-05-07T19:44:03.9331916Z 2025-05-07T19:44:03.9331920Z 2025-05-07T19:44:03.9331924Z 2025-05-07T19:44:03.9331927Z 2025-05-07T19:44:03.9331931Z 2025-05-07T19:44:03.9331937Z 2025-05-07T19:44:03.9441030Z libuuid-2.38.1 | 33 KB | ########## | 100%  2025-05-07T19:44:04.0350326Z python-3.12.2 | 30.8 MB | ######### | 90% 2025-05-07T19:44:04.0350638Z 2025-05-07T19:44:04.0407353Z openssl-3.5.0 | 3.0 MB | ########## | 100%  2025-05-07T19:44:04.0407648Z 2025-05-07T19:44:04.0407654Z 2025-05-07T19:44:04.0407657Z 2025-05-07T19:44:04.0407661Z 2025-05-07T19:44:04.0407664Z 2025-05-07T19:44:04.0407668Z 2025-05-07T19:44:04.0407671Z 2025-05-07T19:44:04.0407675Z 2025-05-07T19:44:04.0407678Z 2025-05-07T19:44:04.0407986Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:04.0408295Z 2025-05-07T19:44:04.0408314Z 2025-05-07T19:44:04.0408318Z 2025-05-07T19:44:04.0408322Z 2025-05-07T19:44:04.0408325Z 2025-05-07T19:44:04.0408329Z 2025-05-07T19:44:04.0408332Z 2025-05-07T19:44:04.0408335Z 2025-05-07T19:44:04.0408339Z 2025-05-07T19:44:04.0493510Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:04.0493850Z 2025-05-07T19:44:04.0493855Z 2025-05-07T19:44:04.0493858Z 2025-05-07T19:44:04.0493862Z 2025-05-07T19:44:04.0493865Z 2025-05-07T19:44:04.0493869Z 2025-05-07T19:44:04.0493899Z 2025-05-07T19:44:04.0493903Z 2025-05-07T19:44:04.0493926Z 2025-05-07T19:44:04.0493963Z 2025-05-07T19:44:04.0493967Z 2025-05-07T19:44:04.0493970Z 2025-05-07T19:44:04.0493974Z 2025-05-07T19:44:04.0494275Z libexpat-2.7.0 | 73 KB | ########## | 100%  2025-05-07T19:44:04.0494590Z 2025-05-07T19:44:04.0494594Z 2025-05-07T19:44:04.0494597Z 2025-05-07T19:44:04.0494625Z 2025-05-07T19:44:04.0494628Z 2025-05-07T19:44:04.0494839Z 2025-05-07T19:44:04.0494843Z 2025-05-07T19:44:04.0494846Z 2025-05-07T19:44:04.0494849Z 2025-05-07T19:44:04.0494853Z 2025-05-07T19:44:04.0494856Z 2025-05-07T19:44:04.0494860Z 2025-05-07T19:44:04.0494867Z 2025-05-07T19:44:04.0556043Z libexpat-2.7.0 | 73 KB | ########## | 100%  2025-05-07T19:44:04.0556419Z 2025-05-07T19:44:04.0556424Z 2025-05-07T19:44:04.0556428Z 2025-05-07T19:44:04.0556431Z 2025-05-07T19:44:04.0556435Z 2025-05-07T19:44:04.0556439Z 2025-05-07T19:44:04.0556442Z 2025-05-07T19:44:04.0556446Z 2025-05-07T19:44:04.0556467Z 2025-05-07T19:44:04.0556471Z 2025-05-07T19:44:04.0556474Z 2025-05-07T19:44:04.0556478Z 2025-05-07T19:44:04.0556481Z 2025-05-07T19:44:04.0556485Z 2025-05-07T19:44:04.0558227Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:04.0558541Z 2025-05-07T19:44:04.0558545Z 2025-05-07T19:44:04.0558548Z 2025-05-07T19:44:04.0558552Z 2025-05-07T19:44:04.0558563Z 2025-05-07T19:44:04.0558566Z 2025-05-07T19:44:04.0558570Z 2025-05-07T19:44:04.0558573Z 2025-05-07T19:44:04.0558577Z 2025-05-07T19:44:04.0558580Z 2025-05-07T19:44:04.0558584Z 2025-05-07T19:44:04.0558592Z 2025-05-07T19:44:04.0558595Z 2025-05-07T19:44:04.0558599Z 2025-05-07T19:44:04.0621199Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:04.0621538Z 2025-05-07T19:44:04.0621543Z 2025-05-07T19:44:04.0621547Z 2025-05-07T19:44:04.0621550Z 2025-05-07T19:44:04.0621554Z 2025-05-07T19:44:04.0621557Z 2025-05-07T19:44:04.0621748Z 2025-05-07T19:44:04.0621769Z 2025-05-07T19:44:04.0621773Z 2025-05-07T19:44:04.0621776Z 2025-05-07T19:44:04.0621780Z 2025-05-07T19:44:04.0621783Z 2025-05-07T19:44:04.0622102Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:04.0622435Z 2025-05-07T19:44:04.0622446Z 2025-05-07T19:44:04.0622450Z 2025-05-07T19:44:04.0622453Z 2025-05-07T19:44:04.0622475Z 2025-05-07T19:44:04.0622479Z 2025-05-07T19:44:04.0622482Z 2025-05-07T19:44:04.0622486Z 2025-05-07T19:44:04.0622489Z 2025-05-07T19:44:04.0622493Z 2025-05-07T19:44:04.0622496Z 2025-05-07T19:44:04.0622500Z 2025-05-07T19:44:04.0789256Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:04.0789641Z 2025-05-07T19:44:04.0789645Z 2025-05-07T19:44:04.0789649Z 2025-05-07T19:44:04.0789652Z 2025-05-07T19:44:04.0789656Z 2025-05-07T19:44:04.0789660Z 2025-05-07T19:44:04.0789663Z 2025-05-07T19:44:04.0789667Z 2025-05-07T19:44:04.0789683Z 2025-05-07T19:44:04.0789686Z 2025-05-07T19:44:04.0789690Z 2025-05-07T19:44:04.0789934Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:04.0790229Z 2025-05-07T19:44:04.0790232Z 2025-05-07T19:44:04.0790236Z 2025-05-07T19:44:04.0790239Z 2025-05-07T19:44:04.0790243Z 2025-05-07T19:44:04.0790246Z 2025-05-07T19:44:04.0790250Z 2025-05-07T19:44:04.0790253Z 2025-05-07T19:44:04.0790262Z 2025-05-07T19:44:04.0790265Z 2025-05-07T19:44:04.0790272Z 2025-05-07T19:44:04.0797334Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:04.0797621Z 2025-05-07T19:44:04.0797624Z 2025-05-07T19:44:04.0797628Z 2025-05-07T19:44:04.0797631Z 2025-05-07T19:44:04.0797635Z 2025-05-07T19:44:04.0797646Z 2025-05-07T19:44:04.0797649Z 2025-05-07T19:44:04.0797653Z 2025-05-07T19:44:04.0797656Z 2025-05-07T19:44:04.0797660Z 2025-05-07T19:44:04.0797663Z 2025-05-07T19:44:04.0797667Z 2025-05-07T19:44:04.0797670Z 2025-05-07T19:44:04.0797680Z 2025-05-07T19:44:04.0797684Z 2025-05-07T19:44:04.0804681Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:04.0805062Z 2025-05-07T19:44:04.0805066Z 2025-05-07T19:44:04.0805071Z 2025-05-07T19:44:04.0805075Z 2025-05-07T19:44:04.0805080Z 2025-05-07T19:44:04.0805090Z 2025-05-07T19:44:04.0805095Z 2025-05-07T19:44:04.0805099Z 2025-05-07T19:44:04.0805285Z 2025-05-07T19:44:04.0805289Z 2025-05-07T19:44:04.0805293Z 2025-05-07T19:44:04.0805296Z 2025-05-07T19:44:04.0805300Z 2025-05-07T19:44:04.0805303Z 2025-05-07T19:44:04.0805306Z 2025-05-07T19:44:04.0831655Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:04.0832022Z 2025-05-07T19:44:04.0832039Z 2025-05-07T19:44:04.0833776Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:04.0834057Z 2025-05-07T19:44:04.0834064Z 2025-05-07T19:44:04.0953423Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:04.0953728Z 2025-05-07T19:44:04.0953733Z 2025-05-07T19:44:04.0953736Z 2025-05-07T19:44:04.0953740Z 2025-05-07T19:44:04.0953744Z 2025-05-07T19:44:04.0953747Z 2025-05-07T19:44:04.0953751Z 2025-05-07T19:44:04.0953754Z 2025-05-07T19:44:04.0953758Z 2025-05-07T19:44:04.0953761Z 2025-05-07T19:44:04.0953765Z 2025-05-07T19:44:04.0953768Z 2025-05-07T19:44:04.0953772Z 2025-05-07T19:44:04.0953782Z 2025-05-07T19:44:04.0953799Z 2025-05-07T19:44:04.0953802Z 2025-05-07T19:44:04.0953806Z 2025-05-07T19:44:04.0953809Z 2025-05-07T19:44:04.0953813Z 2025-05-07T19:44:04.1098753Z ... (more hidden) ... 2025-05-07T19:44:04.1099084Z 2025-05-07T19:44:04.1099089Z 2025-05-07T19:44:04.1099093Z 2025-05-07T19:44:04.1099096Z 2025-05-07T19:44:04.1099100Z 2025-05-07T19:44:04.1099103Z 2025-05-07T19:44:04.1099107Z 2025-05-07T19:44:04.1099110Z 2025-05-07T19:44:04.1099114Z 2025-05-07T19:44:04.1099117Z 2025-05-07T19:44:04.1099293Z 2025-05-07T19:44:04.1099314Z 2025-05-07T19:44:04.1099318Z 2025-05-07T19:44:04.1099321Z 2025-05-07T19:44:04.1099325Z 2025-05-07T19:44:04.1099328Z 2025-05-07T19:44:04.1099332Z 2025-05-07T19:44:04.1099339Z 2025-05-07T19:44:04.1105271Z libnsl-2.0.1 | 33 KB | ########## | 100%  2025-05-07T19:44:04.1105598Z 2025-05-07T19:44:04.1105602Z 2025-05-07T19:44:04.1105613Z 2025-05-07T19:44:04.1105616Z 2025-05-07T19:44:04.1105619Z 2025-05-07T19:44:04.1105623Z 2025-05-07T19:44:04.1105626Z 2025-05-07T19:44:04.1105636Z 2025-05-07T19:44:04.1105640Z 2025-05-07T19:44:04.1105643Z 2025-05-07T19:44:04.1105647Z 2025-05-07T19:44:04.1105650Z 2025-05-07T19:44:04.1105654Z 2025-05-07T19:44:04.1105657Z 2025-05-07T19:44:04.1105661Z 2025-05-07T19:44:04.1105664Z 2025-05-07T19:44:04.1105668Z 2025-05-07T19:44:04.1105671Z 2025-05-07T19:44:04.1150607Z libnsl-2.0.1 | 33 KB | ########## | 100%  2025-05-07T19:44:04.1150960Z 2025-05-07T19:44:04.1150965Z 2025-05-07T19:44:04.1150969Z 2025-05-07T19:44:04.1150973Z 2025-05-07T19:44:04.1150976Z 2025-05-07T19:44:04.1150980Z 2025-05-07T19:44:04.1150983Z 2025-05-07T19:44:04.1150987Z 2025-05-07T19:44:04.1150990Z 2025-05-07T19:44:04.1151014Z 2025-05-07T19:44:04.1151018Z 2025-05-07T19:44:04.1151022Z 2025-05-07T19:44:04.1151025Z 2025-05-07T19:44:04.1151034Z 2025-05-07T19:44:04.1151037Z 2025-05-07T19:44:04.1151041Z 2025-05-07T19:44:04.1151044Z 2025-05-07T19:44:04.1151448Z libuuid-2.38.1 | 33 KB | ########## | 100%  2025-05-07T19:44:04.1151777Z 2025-05-07T19:44:04.1151780Z 2025-05-07T19:44:04.1151807Z 2025-05-07T19:44:04.1151810Z 2025-05-07T19:44:04.1151814Z 2025-05-07T19:44:04.1151817Z 2025-05-07T19:44:04.1151821Z 2025-05-07T19:44:04.1151824Z 2025-05-07T19:44:04.1151828Z 2025-05-07T19:44:04.1151831Z 2025-05-07T19:44:04.1151835Z 2025-05-07T19:44:04.1151843Z 2025-05-07T19:44:04.1151846Z 2025-05-07T19:44:04.1151850Z 2025-05-07T19:44:04.1151853Z 2025-05-07T19:44:04.1151857Z 2025-05-07T19:44:04.1151860Z 2025-05-07T19:44:04.1192708Z libuuid-2.38.1 | 33 KB | ########## | 100%  2025-05-07T19:44:04.1193107Z 2025-05-07T19:44:04.1193112Z 2025-05-07T19:44:04.1193116Z 2025-05-07T19:44:04.1193119Z 2025-05-07T19:44:04.1193287Z 2025-05-07T19:44:04.1193290Z 2025-05-07T19:44:04.1193294Z 2025-05-07T19:44:04.1193297Z 2025-05-07T19:44:04.1193301Z 2025-05-07T19:44:04.1193304Z 2025-05-07T19:44:04.1193308Z 2025-05-07T19:44:04.1193311Z 2025-05-07T19:44:04.1193315Z 2025-05-07T19:44:04.1193318Z 2025-05-07T19:44:04.1193346Z 2025-05-07T19:44:04.1193349Z 2025-05-07T19:44:04.1193666Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:04.1193988Z 2025-05-07T19:44:04.1193992Z 2025-05-07T19:44:04.1193995Z 2025-05-07T19:44:04.1193999Z 2025-05-07T19:44:04.1194008Z 2025-05-07T19:44:04.1194012Z 2025-05-07T19:44:04.1194015Z 2025-05-07T19:44:04.1194041Z 2025-05-07T19:44:04.1194045Z 2025-05-07T19:44:04.1194048Z 2025-05-07T19:44:04.1194051Z 2025-05-07T19:44:04.1194055Z 2025-05-07T19:44:04.1194059Z 2025-05-07T19:44:04.1194062Z 2025-05-07T19:44:04.1194066Z 2025-05-07T19:44:04.1194069Z 2025-05-07T19:44:04.1668651Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:04.7825518Z python-3.12.2 | 30.8 MB | ########## | 100% 2025-05-07T19:44:04.7835148Z python-3.12.2 | 30.8 MB | ########## | 100% 2025-05-07T19:44:04.7835543Z 2025-05-07T19:44:04.7835547Z 2025-05-07T19:44:04.7835551Z 2025-05-07T19:44:04.7835555Z 2025-05-07T19:44:04.7835559Z 2025-05-07T19:44:04.7835562Z 2025-05-07T19:44:04.7835566Z 2025-05-07T19:44:04.7835570Z 2025-05-07T19:44:04.7835573Z 2025-05-07T19:44:04.7835577Z 2025-05-07T19:44:04.7835581Z 2025-05-07T19:44:04.7835584Z 2025-05-07T19:44:04.7835859Z 2025-05-07T19:44:04.7835864Z 2025-05-07T19:44:04.7835900Z 2025-05-07T19:44:04.7835904Z 2025-05-07T19:44:04.7835908Z 2025-05-07T19:44:04.7835911Z 2025-05-07T19:44:04.7835915Z 2025-05-07T19:44:04.7836024Z 2025-05-07T19:44:04.7836473Z  2025-05-07T19:44:04.7836866Z 2025-05-07T19:44:04.7837096Z 2025-05-07T19:44:04.7837303Z  2025-05-07T19:44:04.7837525Z 2025-05-07T19:44:04.7837529Z 2025-05-07T19:44:04.7837707Z  2025-05-07T19:44:04.7837960Z 2025-05-07T19:44:04.7837964Z 2025-05-07T19:44:04.7837968Z 2025-05-07T19:44:04.7838150Z  2025-05-07T19:44:04.7838376Z 2025-05-07T19:44:04.7838380Z 2025-05-07T19:44:04.7838383Z 2025-05-07T19:44:04.7838387Z 2025-05-07T19:44:04.7838600Z  2025-05-07T19:44:04.7838831Z 2025-05-07T19:44:04.7838834Z 2025-05-07T19:44:04.7838838Z 2025-05-07T19:44:04.7838841Z 2025-05-07T19:44:04.7838845Z 2025-05-07T19:44:04.7839057Z  2025-05-07T19:44:04.7839292Z 2025-05-07T19:44:04.7839296Z 2025-05-07T19:44:04.7839299Z 2025-05-07T19:44:04.7839303Z 2025-05-07T19:44:04.7839312Z 2025-05-07T19:44:04.7839315Z 2025-05-07T19:44:04.7839510Z  2025-05-07T19:44:04.7839768Z 2025-05-07T19:44:04.7839772Z 2025-05-07T19:44:04.7839776Z 2025-05-07T19:44:04.7839780Z 2025-05-07T19:44:04.7839783Z 2025-05-07T19:44:04.7839787Z 2025-05-07T19:44:04.7839790Z 2025-05-07T19:44:04.7839986Z  2025-05-07T19:44:04.7840246Z 2025-05-07T19:44:04.7840249Z 2025-05-07T19:44:04.7840252Z 2025-05-07T19:44:04.7840256Z 2025-05-07T19:44:04.7840263Z 2025-05-07T19:44:04.7840267Z 2025-05-07T19:44:04.7840271Z 2025-05-07T19:44:04.7840274Z 2025-05-07T19:44:04.7840470Z  2025-05-07T19:44:04.7840712Z 2025-05-07T19:44:04.7840741Z 2025-05-07T19:44:04.7840744Z 2025-05-07T19:44:04.7840747Z 2025-05-07T19:44:04.7840751Z 2025-05-07T19:44:04.7840754Z 2025-05-07T19:44:04.7840865Z 2025-05-07T19:44:04.7840869Z 2025-05-07T19:44:04.7840873Z 2025-05-07T19:44:04.7841078Z  2025-05-07T19:44:04.7841318Z 2025-05-07T19:44:04.7841321Z 2025-05-07T19:44:04.7841352Z 2025-05-07T19:44:04.7841355Z 2025-05-07T19:44:04.7841359Z 2025-05-07T19:44:04.7841362Z 2025-05-07T19:44:04.7841366Z 2025-05-07T19:44:04.7841369Z 2025-05-07T19:44:04.7841373Z 2025-05-07T19:44:04.7841377Z 2025-05-07T19:44:04.7841580Z  2025-05-07T19:44:04.7841827Z 2025-05-07T19:44:04.7841831Z 2025-05-07T19:44:04.7841859Z 2025-05-07T19:44:04.7841862Z 2025-05-07T19:44:04.7841866Z 2025-05-07T19:44:04.7841869Z 2025-05-07T19:44:04.7841872Z 2025-05-07T19:44:04.7841876Z 2025-05-07T19:44:04.7841879Z 2025-05-07T19:44:04.7841883Z 2025-05-07T19:44:04.7841886Z 2025-05-07T19:44:04.7842097Z  2025-05-07T19:44:04.7842350Z 2025-05-07T19:44:04.7842380Z 2025-05-07T19:44:04.7842384Z 2025-05-07T19:44:04.7842387Z 2025-05-07T19:44:04.7842391Z 2025-05-07T19:44:04.7842394Z 2025-05-07T19:44:04.7842398Z 2025-05-07T19:44:04.7842401Z 2025-05-07T19:44:04.7842404Z 2025-05-07T19:44:04.7842408Z 2025-05-07T19:44:04.7842411Z 2025-05-07T19:44:04.7842415Z 2025-05-07T19:44:04.7842685Z  2025-05-07T19:44:04.7842938Z 2025-05-07T19:44:04.7842942Z 2025-05-07T19:44:04.7842945Z 2025-05-07T19:44:04.7843020Z 2025-05-07T19:44:04.7843024Z 2025-05-07T19:44:04.7843028Z 2025-05-07T19:44:04.7843031Z 2025-05-07T19:44:04.7843034Z 2025-05-07T19:44:04.7843038Z 2025-05-07T19:44:04.7843042Z 2025-05-07T19:44:04.7843045Z 2025-05-07T19:44:04.7843049Z 2025-05-07T19:44:04.7843052Z 2025-05-07T19:44:04.7843295Z  2025-05-07T19:44:04.7843551Z 2025-05-07T19:44:04.7843554Z 2025-05-07T19:44:04.7843558Z 2025-05-07T19:44:04.7843561Z 2025-05-07T19:44:04.7843565Z 2025-05-07T19:44:04.7843569Z 2025-05-07T19:44:04.7843572Z 2025-05-07T19:44:04.7843575Z 2025-05-07T19:44:04.7843579Z 2025-05-07T19:44:04.7843582Z 2025-05-07T19:44:04.7843585Z 2025-05-07T19:44:04.7843589Z 2025-05-07T19:44:04.7843592Z 2025-05-07T19:44:04.7843621Z 2025-05-07T19:44:04.7843842Z  2025-05-07T19:44:04.7844096Z 2025-05-07T19:44:04.7844100Z 2025-05-07T19:44:04.7844108Z 2025-05-07T19:44:04.7844112Z 2025-05-07T19:44:04.7844115Z 2025-05-07T19:44:04.7844118Z 2025-05-07T19:44:04.7844122Z 2025-05-07T19:44:04.7844125Z 2025-05-07T19:44:04.7844155Z 2025-05-07T19:44:04.7844158Z 2025-05-07T19:44:04.7844161Z 2025-05-07T19:44:04.7844165Z 2025-05-07T19:44:04.7844169Z 2025-05-07T19:44:04.7844172Z 2025-05-07T19:44:04.7844176Z 2025-05-07T19:44:04.7844407Z  2025-05-07T19:44:04.7844676Z 2025-05-07T19:44:04.7844679Z 2025-05-07T19:44:04.7844683Z 2025-05-07T19:44:04.7844717Z 2025-05-07T19:44:04.7844720Z 2025-05-07T19:44:04.7844725Z 2025-05-07T19:44:04.7844728Z 2025-05-07T19:44:04.7844731Z 2025-05-07T19:44:04.7844735Z 2025-05-07T19:44:04.7844738Z 2025-05-07T19:44:04.7844741Z 2025-05-07T19:44:04.7844745Z 2025-05-07T19:44:04.7844749Z 2025-05-07T19:44:04.7844752Z 2025-05-07T19:44:04.7844755Z 2025-05-07T19:44:04.7844759Z 2025-05-07T19:44:04.7844992Z  2025-05-07T19:44:04.7845287Z 2025-05-07T19:44:04.7845290Z 2025-05-07T19:44:04.7845293Z 2025-05-07T19:44:04.7845297Z 2025-05-07T19:44:04.7845300Z 2025-05-07T19:44:04.7845304Z 2025-05-07T19:44:04.7845307Z 2025-05-07T19:44:04.7845311Z 2025-05-07T19:44:04.7845314Z 2025-05-07T19:44:04.7845317Z 2025-05-07T19:44:04.7845321Z 2025-05-07T19:44:04.7845387Z 2025-05-07T19:44:04.7845391Z 2025-05-07T19:44:04.7845395Z 2025-05-07T19:44:04.7845399Z 2025-05-07T19:44:04.7845402Z 2025-05-07T19:44:04.7845406Z 2025-05-07T19:44:04.7845670Z  2025-05-07T19:44:04.7845933Z 2025-05-07T19:44:04.7845936Z 2025-05-07T19:44:04.7845939Z 2025-05-07T19:44:04.7845943Z 2025-05-07T19:44:04.7845946Z 2025-05-07T19:44:04.7845950Z 2025-05-07T19:44:04.7845953Z 2025-05-07T19:44:04.7845957Z 2025-05-07T19:44:04.7845960Z 2025-05-07T19:44:04.7845967Z 2025-05-07T19:44:04.7845971Z 2025-05-07T19:44:04.7845975Z 2025-05-07T19:44:04.7845978Z 2025-05-07T19:44:04.7846005Z 2025-05-07T19:44:04.7846009Z 2025-05-07T19:44:04.7846012Z 2025-05-07T19:44:04.7846016Z 2025-05-07T19:44:04.7846019Z 2025-05-07T19:44:04.7846257Z  2025-05-07T19:44:04.7846527Z 2025-05-07T19:44:04.7846650Z done 2025-05-07T19:44:04.8850271Z Preparing transaction: - done 2025-05-07T19:44:05.6674594Z Verifying transaction: | / - \ | / - done 2025-05-07T19:44:07.1713702Z Executing transaction: | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:07.3794442Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:09.0868131Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:09.0877593Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:09.0904850Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:09.7570536Z Channels: 2025-05-07T19:44:09.7570943Z - conda-forge 2025-05-07T19:44:09.7571248Z Platform: linux-64 2025-05-07T19:44:12.8471683Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:13.2833523Z Solving environment: \ | done 2025-05-07T19:44:13.3055273Z 2025-05-07T19:44:13.3055983Z # All requested packages already installed. 2025-05-07T19:44:13.3056775Z 2025-05-07T19:44:16.5769787Z [SETUP] Copying over ... 2025-05-07T19:44:16.5771416Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.12/crypt.h 2025-05-07T19:44:16.5772162Z 2025-05-07T19:44:16.5806965Z 2025-05-07T19:44:18.1957923Z [SETUP] Installed Python version: Python 3.12.2 2025-05-07T19:44:18.1958843Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:18.2040619Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:18.2041167Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:18.2041844Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:18.2042227Z env: 2025-05-07T19:44:18.2042477Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:18.2042834Z BUILD_ENV: build_binary 2025-05-07T19:44:18.2043107Z BUILD_TARGET: default 2025-05-07T19:44:18.2043386Z BUILD_VARIANT: cuda 2025-05-07T19:44:18.2043665Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:18.2043967Z ##[endgroup] 2025-05-07T19:44:18.6632128Z ################################################################################ 2025-05-07T19:44:18.6633225Z # Install C/C++ Compilers 2025-05-07T19:44:18.6633581Z # 2025-05-07T19:44:18.6652220Z # [2025-05-07T19:44:18.664Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:18.6652920Z ################################################################################ 2025-05-07T19:44:18.6653170Z 2025-05-07T19:44:18.6677412Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:18.7549202Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:18.7553303Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:18.7580346Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:19.4299954Z Channels: 2025-05-07T19:44:19.4300281Z - conda-forge 2025-05-07T19:44:19.4300589Z Platform: linux-64 2025-05-07T19:44:22.4767172Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:22.9062711Z Solving environment: \ | done 2025-05-07T19:44:22.9538575Z 2025-05-07T19:44:22.9539085Z ## Package Plan ## 2025-05-07T19:44:22.9539355Z 2025-05-07T19:44:22.9539623Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:22.9539974Z 2025-05-07T19:44:22.9540123Z added / updated specs: 2025-05-07T19:44:22.9540480Z - sysroot_linux-64=2.17 2025-05-07T19:44:22.9540699Z 2025-05-07T19:44:22.9540704Z 2025-05-07T19:44:22.9540851Z The following packages will be downloaded: 2025-05-07T19:44:22.9541101Z 2025-05-07T19:44:22.9541238Z package | build 2025-05-07T19:44:22.9541638Z ---------------------------|----------------- 2025-05-07T19:44:22.9542150Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:22.9542698Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:22.9543215Z ------------------------------------------------------------ 2025-05-07T19:44:22.9543600Z Total: 15.4 MB 2025-05-07T19:44:22.9543873Z 2025-05-07T19:44:22.9544018Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:22.9544269Z 2025-05-07T19:44:22.9544627Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:22.9545258Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:22.9545601Z 2025-05-07T19:44:22.9545632Z 2025-05-07T19:44:22.9545636Z 2025-05-07T19:44:22.9545777Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:22.9568640Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:22.9568920Z 2025-05-07T19:44:23.1223514Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:23.1933754Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:23.1934387Z 2025-05-07T19:44:23.2013530Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:23.2013948Z 2025-05-07T19:44:23.2235464Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.3246904Z sysroot_linux-64-2.1 | 14.5 MB | ###6 | 37% 2025-05-07T19:44:23.3939328Z sysroot_linux-64-2.1 | 14.5 MB | #########7 | 98% 2025-05-07T19:44:23.4029660Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.4029972Z 2025-05-07T19:44:23.4030658Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.4030959Z 2025-05-07T19:44:23.8687272Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.8688280Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.8688721Z 2025-05-07T19:44:23.8689024Z 2025-05-07T19:44:23.8689697Z  done 2025-05-07T19:44:23.9702605Z Preparing transaction: - done 2025-05-07T19:44:24.1722556Z Verifying transaction: | / done 2025-05-07T19:44:24.2732876Z Executing transaction: \ done 2025-05-07T19:44:24.3596152Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:24.3596630Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:26.0228435Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:26.0235289Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:26.0260949Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:26.7195517Z Channels: 2025-05-07T19:44:26.7195815Z - conda-forge 2025-05-07T19:44:26.7196075Z Platform: linux-64 2025-05-07T19:44:29.7710866Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:30.9007104Z Solving environment: \ | / done 2025-05-07T19:44:30.9513076Z 2025-05-07T19:44:30.9513987Z ## Package Plan ## 2025-05-07T19:44:30.9514664Z 2025-05-07T19:44:30.9514911Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:30.9515280Z 2025-05-07T19:44:30.9515411Z added / updated specs: 2025-05-07T19:44:30.9515756Z - gxx_linux-64=11.4.0 2025-05-07T19:44:30.9515940Z 2025-05-07T19:44:30.9515996Z 2025-05-07T19:44:30.9516172Z The following packages will be downloaded: 2025-05-07T19:44:30.9516415Z 2025-05-07T19:44:30.9516577Z package | build 2025-05-07T19:44:30.9516977Z ---------------------------|----------------- 2025-05-07T19:44:30.9517470Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:30.9518015Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:30.9518572Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:30.9519070Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:30.9519603Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:30.9520089Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:30.9520604Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:30.9521154Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:30.9521693Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:30.9522212Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:30.9522741Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:30.9523304Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:30.9523760Z ------------------------------------------------------------ 2025-05-07T19:44:30.9524173Z Total: 91.6 MB 2025-05-07T19:44:30.9524417Z 2025-05-07T19:44:30.9524595Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:30.9524847Z 2025-05-07T19:44:30.9525165Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:30.9525836Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:30.9526624Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:30.9527241Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:30.9527840Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:30.9528408Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:30.9529034Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:30.9529663Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:30.9530260Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:30.9530895Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:30.9531301Z 2025-05-07T19:44:30.9531435Z The following packages will be UPDATED: 2025-05-07T19:44:30.9531696Z 2025-05-07T19:44:30.9532048Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:30.9532878Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:30.9533332Z 2025-05-07T19:44:30.9533336Z 2025-05-07T19:44:30.9533340Z 2025-05-07T19:44:30.9533501Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:30.9533948Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:30.9534204Z 2025-05-07T19:44:30.9534678Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:30.9534974Z 2025-05-07T19:44:30.9535071Z 2025-05-07T19:44:30.9535314Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:30.9535601Z 2025-05-07T19:44:30.9535604Z 2025-05-07T19:44:30.9535640Z 2025-05-07T19:44:30.9556484Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:30.9556806Z 2025-05-07T19:44:30.9556811Z 2025-05-07T19:44:30.9556815Z 2025-05-07T19:44:30.9556818Z 2025-05-07T19:44:30.9563343Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:30.9564256Z 2025-05-07T19:44:30.9564275Z 2025-05-07T19:44:30.9564288Z 2025-05-07T19:44:30.9564300Z 2025-05-07T19:44:30.9599943Z 2025-05-07T19:44:30.9603329Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:30.9603678Z 2025-05-07T19:44:30.9603684Z 2025-05-07T19:44:30.9603689Z 2025-05-07T19:44:30.9603695Z 2025-05-07T19:44:30.9603700Z 2025-05-07T19:44:30.9603713Z 2025-05-07T19:44:30.9608289Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:30.9608637Z 2025-05-07T19:44:30.9608642Z 2025-05-07T19:44:30.9608647Z 2025-05-07T19:44:30.9608653Z 2025-05-07T19:44:30.9608656Z 2025-05-07T19:44:30.9608662Z 2025-05-07T19:44:30.9608669Z 2025-05-07T19:44:30.9609261Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:30.9609576Z 2025-05-07T19:44:30.9609586Z 2025-05-07T19:44:30.9609591Z 2025-05-07T19:44:30.9609597Z 2025-05-07T19:44:30.9609617Z 2025-05-07T19:44:30.9609621Z 2025-05-07T19:44:30.9609624Z 2025-05-07T19:44:30.9609628Z 2025-05-07T19:44:30.9610492Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:30.9610825Z 2025-05-07T19:44:30.9610828Z 2025-05-07T19:44:30.9610832Z 2025-05-07T19:44:30.9610841Z 2025-05-07T19:44:30.9610845Z 2025-05-07T19:44:30.9610848Z 2025-05-07T19:44:30.9610851Z 2025-05-07T19:44:30.9610855Z 2025-05-07T19:44:30.9610858Z 2025-05-07T19:44:30.9611676Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:30.9612000Z 2025-05-07T19:44:30.9612011Z 2025-05-07T19:44:30.9612015Z 2025-05-07T19:44:30.9612018Z 2025-05-07T19:44:30.9612021Z 2025-05-07T19:44:30.9612025Z 2025-05-07T19:44:30.9612028Z 2025-05-07T19:44:30.9612032Z 2025-05-07T19:44:30.9612035Z 2025-05-07T19:44:30.9612038Z 2025-05-07T19:44:30.9613334Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:30.9615471Z 2025-05-07T19:44:30.9615479Z 2025-05-07T19:44:30.9615483Z 2025-05-07T19:44:30.9615487Z 2025-05-07T19:44:30.9615491Z 2025-05-07T19:44:30.9615495Z 2025-05-07T19:44:30.9615498Z 2025-05-07T19:44:30.9615501Z 2025-05-07T19:44:30.9615505Z 2025-05-07T19:44:30.9615508Z 2025-05-07T19:44:30.9615512Z 2025-05-07T19:44:31.0620334Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:31.0816630Z 2025-05-07T19:44:31.0817238Z gxx_impl_linux-64-11 | 11.2 MB | 3 | 3%  2025-05-07T19:44:31.0817591Z 2025-05-07T19:44:31.0817602Z 2025-05-07T19:44:31.0817607Z 2025-05-07T19:44:31.0817882Z 2025-05-07T19:44:31.1874218Z libstdcxx-15.1.0 | 3.7 MB | 4 | 4%  2025-05-07T19:44:31.1875171Z 2025-05-07T19:44:31.1875186Z 2025-05-07T19:44:31.1875198Z 2025-05-07T19:44:31.1875210Z 2025-05-07T19:44:31.1893442Z libstdcxx-15.1.0 | 3.7 MB | 8 | 8%  2025-05-07T19:44:31.1894322Z 2025-05-07T19:44:31.2275150Z gxx_impl_linux-64-11 | 11.2 MB | 6 | 6%  2025-05-07T19:44:31.2275994Z 2025-05-07T19:44:31.2276009Z 2025-05-07T19:44:31.2430183Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:31.2431056Z 2025-05-07T19:44:31.2431070Z 2025-05-07T19:44:31.2431082Z 2025-05-07T19:44:31.2431092Z 2025-05-07T19:44:31.2456353Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.2457274Z 2025-05-07T19:44:31.2457289Z 2025-05-07T19:44:31.2457300Z 2025-05-07T19:44:31.2677960Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:31.2747719Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:31.2748611Z 2025-05-07T19:44:31.2748632Z 2025-05-07T19:44:31.2748636Z 2025-05-07T19:44:31.2748640Z 2025-05-07T19:44:31.2748643Z 2025-05-07T19:44:31.2894488Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:31.2895589Z 2025-05-07T19:44:31.3202335Z gxx_impl_linux-64-11 | 11.2 MB | #####1 | 51%  2025-05-07T19:44:31.3203204Z 2025-05-07T19:44:31.3203218Z 2025-05-07T19:44:31.3203229Z 2025-05-07T19:44:31.3203239Z 2025-05-07T19:44:31.3203250Z 2025-05-07T19:44:31.3347596Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.3348540Z 2025-05-07T19:44:31.3348577Z 2025-05-07T19:44:31.3348588Z 2025-05-07T19:44:31.3375083Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.3375978Z 2025-05-07T19:44:31.3375991Z 2025-05-07T19:44:31.3597503Z libstdcxx-devel_linu | 11.1 MB | ###2 | 33%  2025-05-07T19:44:31.3597888Z 2025-05-07T19:44:31.3597892Z 2025-05-07T19:44:31.3597896Z 2025-05-07T19:44:31.3597900Z 2025-05-07T19:44:31.3597903Z 2025-05-07T19:44:31.3597907Z 2025-05-07T19:44:31.3741158Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:31.3742155Z 2025-05-07T19:44:31.3742168Z 2025-05-07T19:44:31.3742180Z 2025-05-07T19:44:31.3742190Z 2025-05-07T19:44:31.3742937Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.3743767Z 2025-05-07T19:44:31.3743778Z 2025-05-07T19:44:31.3743789Z 2025-05-07T19:44:31.3743800Z 2025-05-07T19:44:31.3823061Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.3823975Z 2025-05-07T19:44:31.3824017Z 2025-05-07T19:44:31.3824028Z 2025-05-07T19:44:31.3824039Z 2025-05-07T19:44:31.3824049Z 2025-05-07T19:44:31.3824060Z 2025-05-07T19:44:31.3824071Z 2025-05-07T19:44:31.3903503Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:31.3903999Z 2025-05-07T19:44:31.3904004Z 2025-05-07T19:44:31.3904027Z 2025-05-07T19:44:31.3904031Z 2025-05-07T19:44:31.3904034Z 2025-05-07T19:44:31.3904038Z 2025-05-07T19:44:31.3904041Z 2025-05-07T19:44:31.3933629Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.3934569Z 2025-05-07T19:44:31.4175505Z gxx_impl_linux-64-11 | 11.2 MB | #########9 | 99%  2025-05-07T19:44:31.4176376Z 2025-05-07T19:44:31.4176390Z 2025-05-07T19:44:31.4176401Z 2025-05-07T19:44:31.4176413Z 2025-05-07T19:44:31.4176424Z 2025-05-07T19:44:31.4176435Z 2025-05-07T19:44:31.4176445Z 2025-05-07T19:44:31.4394482Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.4395427Z 2025-05-07T19:44:31.4395440Z 2025-05-07T19:44:31.4480573Z libstdcxx-devel_linu | 11.1 MB | ######## | 80%  2025-05-07T19:44:31.4481078Z 2025-05-07T19:44:31.4532749Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:31.4533086Z 2025-05-07T19:44:31.4533090Z 2025-05-07T19:44:31.4533114Z 2025-05-07T19:44:31.4533117Z 2025-05-07T19:44:31.4533121Z 2025-05-07T19:44:31.4533125Z 2025-05-07T19:44:31.4533128Z 2025-05-07T19:44:31.4533132Z 2025-05-07T19:44:31.4535283Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:31.4535619Z 2025-05-07T19:44:31.4535623Z 2025-05-07T19:44:31.4535636Z 2025-05-07T19:44:31.4535657Z 2025-05-07T19:44:31.4535826Z 2025-05-07T19:44:31.4540723Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.4541615Z 2025-05-07T19:44:31.4541626Z 2025-05-07T19:44:31.4541637Z 2025-05-07T19:44:31.4541647Z 2025-05-07T19:44:31.4541658Z 2025-05-07T19:44:31.4544727Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.4545021Z 2025-05-07T19:44:31.4545040Z 2025-05-07T19:44:31.4545044Z 2025-05-07T19:44:31.4545048Z 2025-05-07T19:44:31.4545274Z 2025-05-07T19:44:31.4545278Z 2025-05-07T19:44:31.4545281Z 2025-05-07T19:44:31.4545605Z 2025-05-07T19:44:31.4909121Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.4909627Z 2025-05-07T19:44:31.4909632Z 2025-05-07T19:44:31.4909635Z 2025-05-07T19:44:31.4909639Z 2025-05-07T19:44:31.4909642Z 2025-05-07T19:44:31.4909646Z 2025-05-07T19:44:31.4909649Z 2025-05-07T19:44:31.4909670Z 2025-05-07T19:44:31.4952986Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.4953980Z 2025-05-07T19:44:31.4953995Z 2025-05-07T19:44:31.4954007Z 2025-05-07T19:44:31.4954018Z 2025-05-07T19:44:31.4954028Z 2025-05-07T19:44:31.4954038Z 2025-05-07T19:44:31.4954049Z 2025-05-07T19:44:31.4954082Z 2025-05-07T19:44:31.4954092Z 2025-05-07T19:44:31.4961890Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:31.4962784Z 2025-05-07T19:44:31.4962795Z 2025-05-07T19:44:31.4962826Z 2025-05-07T19:44:31.4962836Z 2025-05-07T19:44:31.4962846Z 2025-05-07T19:44:31.4962856Z 2025-05-07T19:44:31.4962887Z 2025-05-07T19:44:31.4962897Z 2025-05-07T19:44:31.4962907Z 2025-05-07T19:44:31.4997694Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.5246892Z gcc_impl_linux-64-11 | 53.0 MB | #4 | 15% 2025-05-07T19:44:31.5247700Z 2025-05-07T19:44:31.5247715Z 2025-05-07T19:44:31.5247729Z 2025-05-07T19:44:31.5247773Z 2025-05-07T19:44:31.5247785Z 2025-05-07T19:44:31.5247795Z 2025-05-07T19:44:31.5247806Z 2025-05-07T19:44:31.5247816Z 2025-05-07T19:44:31.5247827Z 2025-05-07T19:44:31.5265959Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.5267205Z 2025-05-07T19:44:31.5267220Z 2025-05-07T19:44:31.5267232Z 2025-05-07T19:44:31.5267242Z 2025-05-07T19:44:31.5267253Z 2025-05-07T19:44:31.5267263Z 2025-05-07T19:44:31.5267274Z 2025-05-07T19:44:31.5267284Z 2025-05-07T19:44:31.5267327Z 2025-05-07T19:44:31.5267338Z 2025-05-07T19:44:31.5267348Z 2025-05-07T19:44:31.5272154Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:31.5273115Z 2025-05-07T19:44:31.5273126Z 2025-05-07T19:44:31.5273136Z 2025-05-07T19:44:31.5273147Z 2025-05-07T19:44:31.5273157Z 2025-05-07T19:44:31.5273167Z 2025-05-07T19:44:31.5273177Z 2025-05-07T19:44:31.5273188Z 2025-05-07T19:44:31.5273198Z 2025-05-07T19:44:31.5273639Z 2025-05-07T19:44:31.5273653Z 2025-05-07T19:44:31.5402330Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.5403348Z 2025-05-07T19:44:31.5403363Z 2025-05-07T19:44:31.5403374Z 2025-05-07T19:44:31.5403386Z 2025-05-07T19:44:31.5403396Z 2025-05-07T19:44:31.5403434Z 2025-05-07T19:44:31.5403445Z 2025-05-07T19:44:31.5403456Z 2025-05-07T19:44:31.5403466Z 2025-05-07T19:44:31.5403477Z 2025-05-07T19:44:31.5418486Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:31.5418860Z 2025-05-07T19:44:31.5418864Z 2025-05-07T19:44:31.5418868Z 2025-05-07T19:44:31.5418892Z 2025-05-07T19:44:31.5418896Z 2025-05-07T19:44:31.5418899Z 2025-05-07T19:44:31.5418902Z 2025-05-07T19:44:31.5418906Z 2025-05-07T19:44:31.5418909Z 2025-05-07T19:44:31.5418912Z 2025-05-07T19:44:31.5564025Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.5565419Z 2025-05-07T19:44:31.5565467Z 2025-05-07T19:44:31.5565478Z 2025-05-07T19:44:31.5565489Z 2025-05-07T19:44:31.5565499Z 2025-05-07T19:44:31.5565510Z 2025-05-07T19:44:31.5565520Z 2025-05-07T19:44:31.5565531Z 2025-05-07T19:44:31.5565542Z 2025-05-07T19:44:31.5565552Z 2025-05-07T19:44:31.5565563Z 2025-05-07T19:44:31.5787005Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.5788016Z 2025-05-07T19:44:31.5788048Z 2025-05-07T19:44:31.5800428Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:31.5801013Z 2025-05-07T19:44:31.5801018Z 2025-05-07T19:44:31.5801021Z 2025-05-07T19:44:31.5801048Z 2025-05-07T19:44:31.5801051Z 2025-05-07T19:44:31.5801055Z 2025-05-07T19:44:31.5801058Z 2025-05-07T19:44:31.5801061Z 2025-05-07T19:44:31.5801065Z 2025-05-07T19:44:31.5801068Z 2025-05-07T19:44:31.5960135Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.5961083Z 2025-05-07T19:44:31.5961099Z 2025-05-07T19:44:31.5961168Z 2025-05-07T19:44:31.5961180Z 2025-05-07T19:44:31.5961191Z 2025-05-07T19:44:31.5961222Z 2025-05-07T19:44:31.6232585Z libgcc-devel_linux-6 | 2.3 MB | #8 | 19%  2025-05-07T19:44:31.6233576Z 2025-05-07T19:44:31.6233590Z 2025-05-07T19:44:31.6233600Z 2025-05-07T19:44:31.6234321Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.6235166Z 2025-05-07T19:44:31.6235178Z 2025-05-07T19:44:31.6235188Z 2025-05-07T19:44:31.6367772Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.6769415Z gcc_impl_linux-64-11 | 53.0 MB | ### | 31% 2025-05-07T19:44:31.6770117Z 2025-05-07T19:44:31.6770132Z 2025-05-07T19:44:31.6770139Z 2025-05-07T19:44:31.6770145Z 2025-05-07T19:44:31.6770152Z 2025-05-07T19:44:31.6770158Z 2025-05-07T19:44:31.7346997Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.7347491Z 2025-05-07T19:44:31.7369205Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:31.7549455Z gcc_impl_linux-64-11 | 53.0 MB | ####8 | 48% 2025-05-07T19:44:31.7550023Z 2025-05-07T19:44:31.7550039Z 2025-05-07T19:44:31.7550046Z 2025-05-07T19:44:31.7550054Z 2025-05-07T19:44:31.7550060Z 2025-05-07T19:44:31.7550064Z 2025-05-07T19:44:31.7551127Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.7551742Z 2025-05-07T19:44:31.7551747Z 2025-05-07T19:44:31.7551753Z 2025-05-07T19:44:31.7551758Z 2025-05-07T19:44:31.7551763Z 2025-05-07T19:44:31.7551812Z 2025-05-07T19:44:31.8911024Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.9394053Z gcc_impl_linux-64-11 | 53.0 MB | #####9 | 60% 2025-05-07T19:44:31.9394358Z 2025-05-07T19:44:31.9394365Z 2025-05-07T19:44:32.0084852Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:32.1104804Z gcc_impl_linux-64-11 | 53.0 MB | ######9 | 70% 2025-05-07T19:44:32.2454363Z gcc_impl_linux-64-11 | 53.0 MB | #########5 | 96% 2025-05-07T19:44:32.7732737Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.7736414Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.7737458Z 2025-05-07T19:44:32.7738086Z 2025-05-07T19:44:32.7738664Z  2025-05-07T19:44:32.7738886Z 2025-05-07T19:44:32.7738892Z 2025-05-07T19:44:32.7739180Z  2025-05-07T19:44:32.7739441Z 2025-05-07T19:44:32.7739446Z 2025-05-07T19:44:32.7739449Z 2025-05-07T19:44:32.7739619Z  2025-05-07T19:44:32.7739836Z 2025-05-07T19:44:32.7739839Z 2025-05-07T19:44:32.7739843Z 2025-05-07T19:44:32.7739864Z 2025-05-07T19:44:32.7740043Z  2025-05-07T19:44:32.7740270Z 2025-05-07T19:44:32.7740281Z 2025-05-07T19:44:32.7740285Z 2025-05-07T19:44:32.7740288Z 2025-05-07T19:44:32.7740291Z 2025-05-07T19:44:32.7740487Z  2025-05-07T19:44:32.7740710Z 2025-05-07T19:44:32.7740714Z 2025-05-07T19:44:32.7740717Z 2025-05-07T19:44:32.7740721Z 2025-05-07T19:44:32.7740724Z 2025-05-07T19:44:32.7740728Z 2025-05-07T19:44:32.7740906Z  2025-05-07T19:44:32.7741146Z 2025-05-07T19:44:32.7741150Z 2025-05-07T19:44:32.7741376Z 2025-05-07T19:44:32.7741380Z 2025-05-07T19:44:32.7741384Z 2025-05-07T19:44:32.7741387Z 2025-05-07T19:44:32.7741391Z 2025-05-07T19:44:32.7741586Z  2025-05-07T19:44:32.7741830Z 2025-05-07T19:44:32.7741833Z 2025-05-07T19:44:32.7741837Z 2025-05-07T19:44:32.7741840Z 2025-05-07T19:44:32.7741843Z 2025-05-07T19:44:32.7741847Z 2025-05-07T19:44:32.7741850Z 2025-05-07T19:44:32.7741858Z 2025-05-07T19:44:32.7742072Z  2025-05-07T19:44:32.7742316Z 2025-05-07T19:44:32.7742319Z 2025-05-07T19:44:32.7742323Z 2025-05-07T19:44:32.7742326Z 2025-05-07T19:44:32.7742330Z 2025-05-07T19:44:32.7742333Z 2025-05-07T19:44:32.7742337Z 2025-05-07T19:44:32.7742340Z 2025-05-07T19:44:32.7742343Z 2025-05-07T19:44:32.7742531Z  2025-05-07T19:44:32.7742776Z 2025-05-07T19:44:32.7742780Z 2025-05-07T19:44:32.7742787Z 2025-05-07T19:44:32.7742791Z 2025-05-07T19:44:32.7742794Z 2025-05-07T19:44:32.7742797Z 2025-05-07T19:44:32.7742801Z 2025-05-07T19:44:32.7742805Z 2025-05-07T19:44:32.7742809Z 2025-05-07T19:44:32.7742812Z 2025-05-07T19:44:32.7743008Z  2025-05-07T19:44:32.7743257Z 2025-05-07T19:44:32.7743261Z 2025-05-07T19:44:32.7743264Z 2025-05-07T19:44:32.7743268Z 2025-05-07T19:44:32.7743275Z 2025-05-07T19:44:32.7743279Z 2025-05-07T19:44:32.7743282Z 2025-05-07T19:44:32.7743285Z 2025-05-07T19:44:32.7743289Z 2025-05-07T19:44:32.7743292Z 2025-05-07T19:44:32.7743296Z 2025-05-07T19:44:32.7743500Z  done 2025-05-07T19:44:32.8750732Z Preparing transaction: \ done 2025-05-07T19:44:33.0760696Z Verifying transaction: / - done 2025-05-07T19:44:33.1775416Z Executing transaction: | done 2025-05-07T19:44:33.2691146Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:37.0321798Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:37.0322665Z 2025-05-07T19:44:37.0334005Z 2025-05-07T19:44:37.0353928Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:37.0355719Z 2025-05-07T19:44:37.0365800Z 2025-05-07T19:44:37.0394723Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:37.0395359Z 2025-05-07T19:44:37.0411906Z 2025-05-07T19:44:37.0433107Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:37.0433721Z 2025-05-07T19:44:37.0442817Z 2025-05-07T19:44:37.0451357Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:37.0474634Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:37.7551732Z Channels: 2025-05-07T19:44:37.7552407Z - conda-forge 2025-05-07T19:44:37.7552905Z Platform: linux-64 2025-05-07T19:44:40.7068979Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:41.2087731Z Solving environment: \ | done 2025-05-07T19:44:41.2641413Z 2025-05-07T19:44:41.2641735Z ## Package Plan ## 2025-05-07T19:44:41.2641934Z 2025-05-07T19:44:41.2642255Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:41.2642601Z 2025-05-07T19:44:41.2642759Z added / updated specs: 2025-05-07T19:44:41.2643020Z - clangxx=16.0.6 2025-05-07T19:44:41.2643282Z - compiler-rt=16.0.6 2025-05-07T19:44:41.2643536Z - libcxx 2025-05-07T19:44:41.2643776Z - llvm-openmp=16.0.6 2025-05-07T19:44:41.2643940Z 2025-05-07T19:44:41.2643944Z 2025-05-07T19:44:41.2644384Z The following packages will be downloaded: 2025-05-07T19:44:41.2644616Z 2025-05-07T19:44:41.2644741Z package | build 2025-05-07T19:44:41.2645096Z ---------------------------|----------------- 2025-05-07T19:44:41.2645498Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:41.2645999Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:41.2646474Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:41.2646966Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:41.2647479Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:41.2647934Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:41.2648529Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:41.2649181Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:41.2649640Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:41.2650105Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:41.2650553Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:41.2651024Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:41.2651486Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:41.2651955Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:41.2652351Z ------------------------------------------------------------ 2025-05-07T19:44:41.2652730Z Total: 142.5 MB 2025-05-07T19:44:41.2652952Z 2025-05-07T19:44:41.2653106Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:41.2653347Z 2025-05-07T19:44:41.2653589Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:41.2654122Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:41.2654760Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:41.2655397Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:41.2656074Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:41.2656559Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:41.2657315Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:41.2657867Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:41.2658365Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:41.2661310Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:41.2661813Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:41.2662319Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:41.2662815Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:41.2663318Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:41.2663588Z 2025-05-07T19:44:41.2663592Z 2025-05-07T19:44:41.2663596Z 2025-05-07T19:44:41.2663769Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:41.2664176Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:41.2664546Z 2025-05-07T19:44:41.2665078Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:41.2665344Z 2025-05-07T19:44:41.2665348Z 2025-05-07T19:44:41.2665595Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:41.2665860Z 2025-05-07T19:44:41.2666688Z 2025-05-07T19:44:41.2666692Z 2025-05-07T19:44:41.2666960Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:41.2667265Z 2025-05-07T19:44:41.2667269Z 2025-05-07T19:44:41.2667272Z 2025-05-07T19:44:41.2669798Z 2025-05-07T19:44:41.2684892Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:41.2685673Z 2025-05-07T19:44:41.2685687Z 2025-05-07T19:44:41.2685717Z 2025-05-07T19:44:41.2685750Z 2025-05-07T19:44:41.2685793Z 2025-05-07T19:44:41.2687608Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:41.2688495Z 2025-05-07T19:44:41.2688533Z 2025-05-07T19:44:41.2688545Z 2025-05-07T19:44:41.2688556Z 2025-05-07T19:44:41.2688567Z 2025-05-07T19:44:41.2688619Z 2025-05-07T19:44:41.2689341Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:41.2690144Z 2025-05-07T19:44:41.2690284Z 2025-05-07T19:44:41.2690288Z 2025-05-07T19:44:41.2690311Z 2025-05-07T19:44:41.2690315Z 2025-05-07T19:44:41.2690345Z 2025-05-07T19:44:41.2690348Z 2025-05-07T19:44:41.2690596Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:41.2690890Z 2025-05-07T19:44:41.2690894Z 2025-05-07T19:44:41.2690897Z 2025-05-07T19:44:41.2690901Z 2025-05-07T19:44:41.2690904Z 2025-05-07T19:44:41.2690908Z 2025-05-07T19:44:41.2690928Z 2025-05-07T19:44:41.2690947Z 2025-05-07T19:44:41.2691202Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:41.2691488Z 2025-05-07T19:44:41.2691492Z 2025-05-07T19:44:41.2691495Z 2025-05-07T19:44:41.2691499Z 2025-05-07T19:44:41.2691502Z 2025-05-07T19:44:41.2691506Z 2025-05-07T19:44:41.2691527Z 2025-05-07T19:44:41.2691531Z 2025-05-07T19:44:41.2694067Z 2025-05-07T19:44:41.2694343Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:41.2694626Z 2025-05-07T19:44:41.2694629Z 2025-05-07T19:44:41.2694633Z 2025-05-07T19:44:41.2694651Z 2025-05-07T19:44:41.2694680Z 2025-05-07T19:44:41.2694683Z 2025-05-07T19:44:41.2694687Z 2025-05-07T19:44:41.2694690Z 2025-05-07T19:44:41.2694694Z 2025-05-07T19:44:41.2694698Z 2025-05-07T19:44:41.2694961Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:41.2695265Z 2025-05-07T19:44:41.2695268Z 2025-05-07T19:44:41.2695272Z 2025-05-07T19:44:41.2695275Z 2025-05-07T19:44:41.2695298Z 2025-05-07T19:44:41.2695301Z 2025-05-07T19:44:41.2695526Z 2025-05-07T19:44:41.2695532Z 2025-05-07T19:44:41.2695535Z 2025-05-07T19:44:41.2695539Z 2025-05-07T19:44:41.2695542Z 2025-05-07T19:44:41.2695803Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:41.2696091Z 2025-05-07T19:44:41.2696095Z 2025-05-07T19:44:41.2696098Z 2025-05-07T19:44:41.2696126Z 2025-05-07T19:44:41.2696130Z 2025-05-07T19:44:41.2696134Z 2025-05-07T19:44:41.2696137Z 2025-05-07T19:44:41.2696141Z 2025-05-07T19:44:41.2696145Z 2025-05-07T19:44:41.2696153Z 2025-05-07T19:44:41.2696156Z 2025-05-07T19:44:41.2696160Z 2025-05-07T19:44:41.2696450Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:41.2696772Z 2025-05-07T19:44:41.2696776Z 2025-05-07T19:44:41.2696779Z 2025-05-07T19:44:41.2696783Z 2025-05-07T19:44:41.2696786Z 2025-05-07T19:44:41.2696790Z 2025-05-07T19:44:41.2696793Z 2025-05-07T19:44:41.2696797Z 2025-05-07T19:44:41.2696801Z 2025-05-07T19:44:41.2696808Z 2025-05-07T19:44:41.2696812Z 2025-05-07T19:44:41.2696816Z 2025-05-07T19:44:41.2696819Z 2025-05-07T19:44:41.4114615Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:41.4115633Z 2025-05-07T19:44:41.4115664Z 2025-05-07T19:44:41.4115676Z 2025-05-07T19:44:41.4165392Z 2025-05-07T19:44:41.5257076Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:41.5257543Z 2025-05-07T19:44:41.5257548Z 2025-05-07T19:44:41.5257551Z 2025-05-07T19:44:41.5367284Z 2025-05-07T19:44:41.6224356Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:41.6225137Z 2025-05-07T19:44:41.6225152Z 2025-05-07T19:44:41.6268293Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:41.6268615Z 2025-05-07T19:44:41.6268635Z 2025-05-07T19:44:41.6268638Z 2025-05-07T19:44:41.6312925Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:41.6313278Z 2025-05-07T19:44:41.6363215Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:41.6874578Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:41.6874862Z 2025-05-07T19:44:41.6874867Z 2025-05-07T19:44:41.6874871Z 2025-05-07T19:44:41.6874874Z 2025-05-07T19:44:41.7228061Z icu-73.2 | 11.5 MB | ##5 | 26%  2025-05-07T19:44:41.7228853Z 2025-05-07T19:44:41.7228881Z 2025-05-07T19:44:41.7270069Z libllvm16-16.0.6 | 33.7 MB | ###1 | 31%  2025-05-07T19:44:41.7270882Z 2025-05-07T19:44:41.7270927Z 2025-05-07T19:44:41.7270956Z 2025-05-07T19:44:41.7315843Z libclang-cpp16-16.0. | 17.3 MB | ####2 | 43%  2025-05-07T19:44:41.7316731Z 2025-05-07T19:44:41.7364674Z compiler-rt_linux-64 | 36.0 MB | ##5 | 25%  2025-05-07T19:44:41.7873810Z llvm-openmp-16.0.6 | 39.9 MB | #9 | 19% 2025-05-07T19:44:41.7874094Z 2025-05-07T19:44:41.7874099Z 2025-05-07T19:44:41.7874117Z 2025-05-07T19:44:41.7874121Z 2025-05-07T19:44:41.8227142Z icu-73.2 | 11.5 MB | ######5 | 66%  2025-05-07T19:44:41.8227928Z 2025-05-07T19:44:41.8227943Z 2025-05-07T19:44:41.8268967Z libllvm16-16.0.6 | 33.7 MB | #####1 | 52%  2025-05-07T19:44:41.8269262Z 2025-05-07T19:44:41.8269266Z 2025-05-07T19:44:41.8269270Z 2025-05-07T19:44:41.8348920Z libclang-cpp16-16.0. | 17.3 MB | #######7 | 78%  2025-05-07T19:44:41.8349820Z 2025-05-07T19:44:41.8367067Z compiler-rt_linux-64 | 36.0 MB | ####3 | 43%  2025-05-07T19:44:41.9226502Z llvm-openmp-16.0.6 | 39.9 MB | ###6 | 37% 2025-05-07T19:44:41.9226786Z 2025-05-07T19:44:41.9226791Z 2025-05-07T19:44:41.9389378Z libllvm16-16.0.6 | 33.7 MB | ######## | 81%  2025-05-07T19:44:41.9389738Z 2025-05-07T19:44:41.9396794Z compiler-rt_linux-64 | 36.0 MB | ######2 | 63%  2025-05-07T19:44:41.9397109Z 2025-05-07T19:44:41.9397124Z 2025-05-07T19:44:41.9397129Z 2025-05-07T19:44:41.9399476Z 2025-05-07T19:44:41.9400001Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:41.9400272Z 2025-05-07T19:44:41.9400277Z 2025-05-07T19:44:41.9400305Z 2025-05-07T19:44:41.9400309Z 2025-05-07T19:44:41.9437631Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:42.0318772Z llvm-openmp-16.0.6 | 39.9 MB | #####6 | 56% 2025-05-07T19:44:42.0319116Z 2025-05-07T19:44:42.0319122Z 2025-05-07T19:44:42.0320383Z 2025-05-07T19:44:42.0390969Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:42.0391506Z 2025-05-07T19:44:42.0437890Z compiler-rt_linux-64 | 36.0 MB | ########3 | 84%  2025-05-07T19:44:42.0751232Z llvm-openmp-16.0.6 | 39.9 MB | #######5 | 76% 2025-05-07T19:44:42.0751671Z 2025-05-07T19:44:42.0751677Z 2025-05-07T19:44:42.0751683Z 2025-05-07T19:44:42.0751690Z 2025-05-07T19:44:42.0751695Z 2025-05-07T19:44:42.0751700Z 2025-05-07T19:44:42.0920594Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:42.0920917Z 2025-05-07T19:44:42.0920922Z 2025-05-07T19:44:42.0920926Z 2025-05-07T19:44:42.0920929Z 2025-05-07T19:44:42.0920933Z 2025-05-07T19:44:42.0920936Z 2025-05-07T19:44:42.1073715Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:42.1074022Z 2025-05-07T19:44:42.1074026Z 2025-05-07T19:44:42.1074030Z 2025-05-07T19:44:42.1074034Z 2025-05-07T19:44:42.1074972Z 2025-05-07T19:44:42.1438841Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:42.1439454Z 2025-05-07T19:44:42.1439460Z 2025-05-07T19:44:42.1439465Z 2025-05-07T19:44:42.1439469Z 2025-05-07T19:44:42.1439473Z 2025-05-07T19:44:42.1439477Z 2025-05-07T19:44:42.1439493Z 2025-05-07T19:44:42.1496267Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:42.1496581Z 2025-05-07T19:44:42.1496587Z 2025-05-07T19:44:42.1496592Z 2025-05-07T19:44:42.1496597Z 2025-05-07T19:44:42.1496602Z 2025-05-07T19:44:42.1705256Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:42.1727488Z llvm-openmp-16.0.6 | 39.9 MB | #########2 | 93% 2025-05-07T19:44:42.1727828Z 2025-05-07T19:44:42.1727900Z 2025-05-07T19:44:42.1727905Z 2025-05-07T19:44:42.1727976Z 2025-05-07T19:44:42.1727981Z 2025-05-07T19:44:42.1727997Z 2025-05-07T19:44:42.1728010Z 2025-05-07T19:44:42.1993951Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:42.1994286Z 2025-05-07T19:44:42.1994291Z 2025-05-07T19:44:42.1994324Z 2025-05-07T19:44:42.1994328Z 2025-05-07T19:44:42.1994333Z 2025-05-07T19:44:42.1994336Z 2025-05-07T19:44:42.1994340Z 2025-05-07T19:44:42.1994343Z 2025-05-07T19:44:42.2158465Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:42.2158799Z 2025-05-07T19:44:42.2158803Z 2025-05-07T19:44:42.2158807Z 2025-05-07T19:44:42.2158811Z 2025-05-07T19:44:42.2158814Z 2025-05-07T19:44:42.2158817Z 2025-05-07T19:44:42.2158838Z 2025-05-07T19:44:42.2158841Z 2025-05-07T19:44:42.2169816Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:42.2170110Z 2025-05-07T19:44:42.2170114Z 2025-05-07T19:44:42.2170117Z 2025-05-07T19:44:42.2170121Z 2025-05-07T19:44:42.2170124Z 2025-05-07T19:44:42.2170128Z 2025-05-07T19:44:42.2170131Z 2025-05-07T19:44:42.2170134Z 2025-05-07T19:44:42.2171314Z 2025-05-07T19:44:42.2294765Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:42.2295091Z 2025-05-07T19:44:42.2295096Z 2025-05-07T19:44:42.2295099Z 2025-05-07T19:44:42.2295103Z 2025-05-07T19:44:42.2295107Z 2025-05-07T19:44:42.2295110Z 2025-05-07T19:44:42.2295114Z 2025-05-07T19:44:42.2295117Z 2025-05-07T19:44:42.2295121Z 2025-05-07T19:44:42.2522266Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:42.2522588Z 2025-05-07T19:44:42.2522593Z 2025-05-07T19:44:42.2522596Z 2025-05-07T19:44:42.2522863Z 2025-05-07T19:44:42.2522868Z 2025-05-07T19:44:42.2522872Z 2025-05-07T19:44:42.2522876Z 2025-05-07T19:44:42.2522879Z 2025-05-07T19:44:42.2522883Z 2025-05-07T19:44:42.2522890Z 2025-05-07T19:44:42.2553008Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:42.2553348Z 2025-05-07T19:44:42.2553353Z 2025-05-07T19:44:42.2553357Z 2025-05-07T19:44:42.2553360Z 2025-05-07T19:44:42.2553364Z 2025-05-07T19:44:42.2553367Z 2025-05-07T19:44:42.2553371Z 2025-05-07T19:44:42.2553374Z 2025-05-07T19:44:42.2553393Z 2025-05-07T19:44:42.2553396Z 2025-05-07T19:44:42.2706384Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:42.2706711Z 2025-05-07T19:44:42.2706932Z 2025-05-07T19:44:42.2706944Z 2025-05-07T19:44:42.2706950Z 2025-05-07T19:44:42.2706956Z 2025-05-07T19:44:42.2706962Z 2025-05-07T19:44:42.2707021Z 2025-05-07T19:44:42.2707027Z 2025-05-07T19:44:42.2707032Z 2025-05-07T19:44:42.2707037Z 2025-05-07T19:44:42.2707083Z 2025-05-07T19:44:42.2736345Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:42.2736742Z 2025-05-07T19:44:42.2736746Z 2025-05-07T19:44:42.2736774Z 2025-05-07T19:44:42.2736779Z 2025-05-07T19:44:42.2736783Z 2025-05-07T19:44:42.2736788Z 2025-05-07T19:44:42.2736792Z 2025-05-07T19:44:42.2736795Z 2025-05-07T19:44:42.2736800Z 2025-05-07T19:44:42.2736804Z 2025-05-07T19:44:42.2736807Z 2025-05-07T19:44:42.3137600Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.3138182Z 2025-05-07T19:44:42.3138188Z 2025-05-07T19:44:42.3138192Z 2025-05-07T19:44:42.3138196Z 2025-05-07T19:44:42.3138201Z 2025-05-07T19:44:42.3138205Z 2025-05-07T19:44:42.3138208Z 2025-05-07T19:44:42.3138213Z 2025-05-07T19:44:42.3138216Z 2025-05-07T19:44:42.3138220Z 2025-05-07T19:44:42.3138223Z 2025-05-07T19:44:42.3138228Z 2025-05-07T19:44:42.3138231Z 2025-05-07T19:44:42.3159605Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:42.3159936Z 2025-05-07T19:44:42.3159940Z 2025-05-07T19:44:42.3159944Z 2025-05-07T19:44:42.3159947Z 2025-05-07T19:44:42.3159950Z 2025-05-07T19:44:42.3159954Z 2025-05-07T19:44:42.3159957Z 2025-05-07T19:44:42.3159961Z 2025-05-07T19:44:42.3159964Z 2025-05-07T19:44:42.3159968Z 2025-05-07T19:44:42.3159971Z 2025-05-07T19:44:42.3159979Z 2025-05-07T19:44:42.3160801Z 2025-05-07T19:44:42.3271203Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:42.3271687Z 2025-05-07T19:44:42.3271692Z 2025-05-07T19:44:42.3271696Z 2025-05-07T19:44:42.3271699Z 2025-05-07T19:44:42.3271703Z 2025-05-07T19:44:42.3271706Z 2025-05-07T19:44:42.3271710Z 2025-05-07T19:44:42.3271713Z 2025-05-07T19:44:42.3271717Z 2025-05-07T19:44:42.3271720Z 2025-05-07T19:44:42.3271724Z 2025-05-07T19:44:42.3271844Z 2025-05-07T19:44:42.3305271Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:42.3305605Z 2025-05-07T19:44:42.3305609Z 2025-05-07T19:44:42.3305613Z 2025-05-07T19:44:42.3305616Z 2025-05-07T19:44:42.3305620Z 2025-05-07T19:44:42.3305623Z 2025-05-07T19:44:42.3305627Z 2025-05-07T19:44:42.3305630Z 2025-05-07T19:44:42.3305634Z 2025-05-07T19:44:42.3305637Z 2025-05-07T19:44:42.3305641Z 2025-05-07T19:44:42.3305662Z 2025-05-07T19:44:42.4005503Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.4006125Z 2025-05-07T19:44:42.4006230Z 2025-05-07T19:44:42.4006236Z 2025-05-07T19:44:42.4006242Z 2025-05-07T19:44:42.4006248Z 2025-05-07T19:44:42.4006269Z 2025-05-07T19:44:42.4007032Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:42.4007360Z 2025-05-07T19:44:42.4007365Z 2025-05-07T19:44:42.4007369Z 2025-05-07T19:44:42.4007374Z 2025-05-07T19:44:42.4007402Z 2025-05-07T19:44:42.4007407Z 2025-05-07T19:44:42.4212148Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:42.4212495Z 2025-05-07T19:44:42.4212499Z 2025-05-07T19:44:42.4431050Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:42.4431498Z 2025-05-07T19:44:42.4431504Z 2025-05-07T19:44:42.4431510Z 2025-05-07T19:44:42.4431516Z 2025-05-07T19:44:42.4431521Z 2025-05-07T19:44:42.4431796Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:42.4432167Z 2025-05-07T19:44:42.4432174Z 2025-05-07T19:44:42.4432180Z 2025-05-07T19:44:42.4432222Z 2025-05-07T19:44:42.4432225Z 2025-05-07T19:44:42.4721145Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:42.4721516Z 2025-05-07T19:44:42.4831446Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:42.4831787Z 2025-05-07T19:44:42.4831794Z 2025-05-07T19:44:42.4831803Z 2025-05-07T19:44:42.4831809Z 2025-05-07T19:44:42.4831841Z 2025-05-07T19:44:42.4831844Z 2025-05-07T19:44:42.4831848Z 2025-05-07T19:44:42.4832162Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:42.4832464Z 2025-05-07T19:44:42.4832469Z 2025-05-07T19:44:42.4832473Z 2025-05-07T19:44:42.4832476Z 2025-05-07T19:44:42.4832479Z 2025-05-07T19:44:42.4832484Z 2025-05-07T19:44:42.4832488Z 2025-05-07T19:44:42.4914913Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:42.4915228Z 2025-05-07T19:44:42.4915242Z 2025-05-07T19:44:42.4915448Z 2025-05-07T19:44:42.4915453Z 2025-05-07T19:44:42.5177823Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:42.5178139Z 2025-05-07T19:44:42.5178144Z 2025-05-07T19:44:42.5178147Z 2025-05-07T19:44:42.5178153Z 2025-05-07T19:44:42.5178156Z 2025-05-07T19:44:42.5178165Z 2025-05-07T19:44:42.5178168Z 2025-05-07T19:44:42.5178173Z 2025-05-07T19:44:42.5178176Z 2025-05-07T19:44:42.5178441Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:42.5178754Z 2025-05-07T19:44:42.5178758Z 2025-05-07T19:44:42.5178763Z 2025-05-07T19:44:42.5178771Z 2025-05-07T19:44:42.5178776Z 2025-05-07T19:44:42.5178782Z 2025-05-07T19:44:42.5178787Z 2025-05-07T19:44:42.5178793Z 2025-05-07T19:44:42.5178797Z 2025-05-07T19:44:42.5229846Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:42.5291548Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:42.5292289Z 2025-05-07T19:44:42.5292295Z 2025-05-07T19:44:42.5292299Z 2025-05-07T19:44:42.5292333Z 2025-05-07T19:44:42.5292337Z 2025-05-07T19:44:42.5292340Z 2025-05-07T19:44:42.5292344Z 2025-05-07T19:44:42.5292348Z 2025-05-07T19:44:42.5292683Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:42.5292978Z 2025-05-07T19:44:42.5292998Z 2025-05-07T19:44:42.5293001Z 2025-05-07T19:44:42.5293005Z 2025-05-07T19:44:42.5293008Z 2025-05-07T19:44:42.5293012Z 2025-05-07T19:44:42.5293015Z 2025-05-07T19:44:42.5293027Z 2025-05-07T19:44:42.5403196Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:42.5403515Z 2025-05-07T19:44:42.5403692Z 2025-05-07T19:44:42.5403701Z 2025-05-07T19:44:42.5403705Z 2025-05-07T19:44:42.5403709Z 2025-05-07T19:44:42.5403712Z 2025-05-07T19:44:42.5403716Z 2025-05-07T19:44:42.5403719Z 2025-05-07T19:44:42.5403723Z 2025-05-07T19:44:42.5403726Z 2025-05-07T19:44:42.5404275Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:42.5404637Z 2025-05-07T19:44:42.5404641Z 2025-05-07T19:44:42.5404644Z 2025-05-07T19:44:42.5404648Z 2025-05-07T19:44:42.5404651Z 2025-05-07T19:44:42.5404655Z 2025-05-07T19:44:42.5404658Z 2025-05-07T19:44:42.5404662Z 2025-05-07T19:44:42.5404673Z 2025-05-07T19:44:42.5404677Z 2025-05-07T19:44:42.5658156Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:42.5659174Z 2025-05-07T19:44:42.5659188Z 2025-05-07T19:44:42.5659725Z 2025-05-07T19:44:42.5659732Z 2025-05-07T19:44:42.5659736Z 2025-05-07T19:44:42.5659740Z 2025-05-07T19:44:42.5659744Z 2025-05-07T19:44:42.5659747Z 2025-05-07T19:44:42.5659751Z 2025-05-07T19:44:42.5659755Z 2025-05-07T19:44:42.5659758Z 2025-05-07T19:44:42.5659762Z 2025-05-07T19:44:42.5659766Z 2025-05-07T19:44:42.5660134Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:42.5660473Z 2025-05-07T19:44:42.5660477Z 2025-05-07T19:44:42.5660480Z 2025-05-07T19:44:42.5660495Z 2025-05-07T19:44:42.5660499Z 2025-05-07T19:44:42.5660502Z 2025-05-07T19:44:42.5660506Z 2025-05-07T19:44:42.5660509Z 2025-05-07T19:44:42.5660513Z 2025-05-07T19:44:42.5660516Z 2025-05-07T19:44:42.5660520Z 2025-05-07T19:44:42.5660523Z 2025-05-07T19:44:42.5660527Z 2025-05-07T19:44:42.5903090Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:42.5903461Z 2025-05-07T19:44:42.5903465Z 2025-05-07T19:44:42.5903487Z 2025-05-07T19:44:42.5903491Z 2025-05-07T19:44:42.5903495Z 2025-05-07T19:44:42.5903499Z 2025-05-07T19:44:42.5903502Z 2025-05-07T19:44:42.5903522Z 2025-05-07T19:44:42.5903526Z 2025-05-07T19:44:42.5903529Z 2025-05-07T19:44:42.5903533Z 2025-05-07T19:44:42.5903874Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.5904163Z 2025-05-07T19:44:42.5904167Z 2025-05-07T19:44:42.5904171Z 2025-05-07T19:44:42.5904174Z 2025-05-07T19:44:42.5904178Z 2025-05-07T19:44:42.5904182Z 2025-05-07T19:44:42.5904623Z 2025-05-07T19:44:42.5904627Z 2025-05-07T19:44:42.5904630Z 2025-05-07T19:44:42.5904634Z 2025-05-07T19:44:42.5904637Z 2025-05-07T19:44:42.6256649Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.6257582Z 2025-05-07T19:44:42.6257597Z 2025-05-07T19:44:42.6257608Z 2025-05-07T19:44:42.6257618Z 2025-05-07T19:44:42.6257630Z 2025-05-07T19:44:42.6257641Z 2025-05-07T19:44:42.6257651Z 2025-05-07T19:44:42.6257692Z 2025-05-07T19:44:42.6257731Z 2025-05-07T19:44:42.6257741Z 2025-05-07T19:44:42.6257752Z 2025-05-07T19:44:42.6257761Z 2025-05-07T19:44:42.6258565Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.6259453Z 2025-05-07T19:44:42.6259464Z 2025-05-07T19:44:42.6259475Z 2025-05-07T19:44:42.6259485Z 2025-05-07T19:44:42.6259496Z 2025-05-07T19:44:42.6259506Z 2025-05-07T19:44:42.6259542Z 2025-05-07T19:44:42.6259552Z 2025-05-07T19:44:42.6259563Z 2025-05-07T19:44:42.6259590Z 2025-05-07T19:44:42.6259600Z 2025-05-07T19:44:42.6259611Z 2025-05-07T19:44:42.6389249Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.6390209Z 2025-05-07T19:44:42.6390224Z 2025-05-07T19:44:42.6390260Z 2025-05-07T19:44:43.0195087Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:43.0195993Z 2025-05-07T19:44:43.0545818Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:43.0546142Z 2025-05-07T19:44:43.0546146Z 2025-05-07T19:44:43.1261689Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:43.1267140Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:43.1267590Z 2025-05-07T19:44:43.1267804Z 2025-05-07T19:44:43.1268011Z  2025-05-07T19:44:43.1268248Z 2025-05-07T19:44:43.1268252Z 2025-05-07T19:44:43.1268467Z  2025-05-07T19:44:43.1268707Z 2025-05-07T19:44:43.1268711Z 2025-05-07T19:44:43.1268714Z 2025-05-07T19:44:43.1268905Z  2025-05-07T19:44:43.1269125Z 2025-05-07T19:44:43.1269129Z 2025-05-07T19:44:43.1269132Z 2025-05-07T19:44:43.1269136Z 2025-05-07T19:44:43.1269315Z  2025-05-07T19:44:43.1269779Z 2025-05-07T19:44:43.1269785Z 2025-05-07T19:44:43.1269789Z 2025-05-07T19:44:43.1269792Z 2025-05-07T19:44:43.1269796Z 2025-05-07T19:44:43.1270958Z  2025-05-07T19:44:43.1271210Z 2025-05-07T19:44:43.1271218Z 2025-05-07T19:44:43.1271283Z 2025-05-07T19:44:43.1271304Z 2025-05-07T19:44:43.1271587Z 2025-05-07T19:44:43.1271602Z 2025-05-07T19:44:43.1272579Z  2025-05-07T19:44:43.1272909Z 2025-05-07T19:44:43.1272956Z 2025-05-07T19:44:43.1272960Z 2025-05-07T19:44:43.1272964Z 2025-05-07T19:44:43.1272967Z 2025-05-07T19:44:43.1272971Z 2025-05-07T19:44:43.1272976Z 2025-05-07T19:44:43.1273217Z  2025-05-07T19:44:43.1273514Z 2025-05-07T19:44:43.1273518Z 2025-05-07T19:44:43.1273523Z 2025-05-07T19:44:43.1273527Z 2025-05-07T19:44:43.1273532Z 2025-05-07T19:44:43.1273563Z 2025-05-07T19:44:43.1273591Z 2025-05-07T19:44:43.1273595Z 2025-05-07T19:44:43.1273827Z  2025-05-07T19:44:43.1274077Z 2025-05-07T19:44:43.1274081Z 2025-05-07T19:44:43.1274085Z 2025-05-07T19:44:43.1274120Z 2025-05-07T19:44:43.1274124Z 2025-05-07T19:44:43.1274127Z 2025-05-07T19:44:43.1274131Z 2025-05-07T19:44:43.1274134Z 2025-05-07T19:44:43.1274138Z 2025-05-07T19:44:43.1274349Z  2025-05-07T19:44:43.1274600Z 2025-05-07T19:44:43.1274873Z 2025-05-07T19:44:43.1274880Z 2025-05-07T19:44:43.1274883Z 2025-05-07T19:44:43.1274918Z 2025-05-07T19:44:43.1274923Z 2025-05-07T19:44:43.1274926Z 2025-05-07T19:44:43.1274930Z 2025-05-07T19:44:43.1274933Z 2025-05-07T19:44:43.1274937Z 2025-05-07T19:44:43.1275169Z  2025-05-07T19:44:43.1275430Z 2025-05-07T19:44:43.1275433Z 2025-05-07T19:44:43.1275437Z 2025-05-07T19:44:43.1275484Z 2025-05-07T19:44:43.1275488Z 2025-05-07T19:44:43.1275491Z 2025-05-07T19:44:43.1275495Z 2025-05-07T19:44:43.1275498Z 2025-05-07T19:44:43.1275502Z 2025-05-07T19:44:43.1275505Z 2025-05-07T19:44:43.1275509Z 2025-05-07T19:44:43.1275732Z  2025-05-07T19:44:43.1275991Z 2025-05-07T19:44:43.1275995Z 2025-05-07T19:44:43.1276028Z 2025-05-07T19:44:43.1276031Z 2025-05-07T19:44:43.1276035Z 2025-05-07T19:44:43.1276038Z 2025-05-07T19:44:43.1276045Z 2025-05-07T19:44:43.1276049Z 2025-05-07T19:44:43.1276052Z 2025-05-07T19:44:43.1276056Z 2025-05-07T19:44:43.1276059Z 2025-05-07T19:44:43.1276063Z 2025-05-07T19:44:43.1276283Z  2025-05-07T19:44:43.1276568Z 2025-05-07T19:44:43.1276572Z 2025-05-07T19:44:43.1276575Z 2025-05-07T19:44:43.1276579Z 2025-05-07T19:44:43.1276582Z 2025-05-07T19:44:43.1276586Z 2025-05-07T19:44:43.1276598Z 2025-05-07T19:44:43.1276602Z 2025-05-07T19:44:43.1276605Z 2025-05-07T19:44:43.1276609Z 2025-05-07T19:44:43.1276612Z 2025-05-07T19:44:43.1276616Z 2025-05-07T19:44:43.1276620Z 2025-05-07T19:44:43.1276855Z  done 2025-05-07T19:44:43.2277923Z Preparing transaction: - done 2025-05-07T19:44:43.3285855Z Verifying transaction: | done 2025-05-07T19:44:43.4299807Z Executing transaction: - done 2025-05-07T19:44:43.5170255Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:47.2626995Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:47.2628549Z 2025-05-07T19:44:47.2651390Z 2025-05-07T19:44:47.2670066Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:47.2671798Z 2025-05-07T19:44:47.2686091Z 2025-05-07T19:44:47.2710505Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:47.2711120Z 2025-05-07T19:44:47.2724718Z 2025-05-07T19:44:47.2744292Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:47.2744834Z 2025-05-07T19:44:47.2757864Z 2025-05-07T19:44:47.2758187Z [INSTALL] Removing GCC package activation scripts ... 2025-05-07T19:44:49.1394997Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:44:49.1395459Z 2025-05-07T19:44:49.1415206Z total 28 2025-05-07T19:44:49.1415566Z drwxr-xr-x. 2 root root 134 May 7 19:44 . 2025-05-07T19:44:49.1415962Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:44:49.1416419Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:44:49.1416909Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:44:49.1417399Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:44:49.1417834Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:44:49.1418120Z 2025-05-07T19:44:49.1420070Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gcc_linux-64.sh 2025-05-07T19:44:49.1421525Z 2025-05-07T19:44:49.1432844Z 2025-05-07T19:44:49.1433555Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gxx_linux-64.sh 2025-05-07T19:44:49.1434018Z 2025-05-07T19:44:49.1455363Z 2025-05-07T19:44:49.1455937Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:49.1456200Z 2025-05-07T19:44:49.5642541Z 2025-05-07T19:44:49.5642892Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:49.5643465Z 2025-05-07T19:44:49.9714584Z 2025-05-07T19:44:49.9715079Z + conda run -n build_binary printenv CC 2025-05-07T19:44:49.9715347Z 2025-05-07T19:44:51.5354155Z 2025-05-07T19:44:51.5354430Z 2025-05-07T19:44:51.5921339Z 2025-05-07T19:44:51.5922025Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:51.5922722Z 2025-05-07T19:44:53.1512841Z 2025-05-07T19:44:53.1512967Z 2025-05-07T19:44:53.2079723Z 2025-05-07T19:44:54.8559252Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:44:56.4358559Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:44:56.4936024Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:44:56.4936571Z 2025-05-07T19:44:56.9143949Z 2025-05-07T19:44:58.5108668Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:58.5109000Z 2025-05-07T19:44:58.5669972Z [CHECK] Binary cc found in PATH 2025-05-07T19:45:00.1449956Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:45:00.1450296Z 2025-05-07T19:45:00.2025343Z [CHECK] Binary gcc found in PATH 2025-05-07T19:45:01.7776928Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:45:01.7777799Z 2025-05-07T19:45:01.8353420Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:03.4268274Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:03.4269110Z 2025-05-07T19:45:03.5034517Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:03.5036711Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:03.5037177Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:03.5037395Z 2025-05-07T19:45:05.1454212Z #define _LP64 1 2025-05-07T19:45:05.1454534Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:05.1454845Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:05.1455124Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:05.1455396Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:05.1455649Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:05.1455921Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:05.1456189Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:05.1456494Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:05.1456786Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:05.1457446Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:05.1457809Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:05.1458184Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:05.1458467Z #define __CHAR_BIT__ 8 2025-05-07T19:45:05.1458733Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:05.1459082Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:05.1459420Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:05.1459750Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:05.1460152Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:05.1460459Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:05.1460791Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:05.1461112Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:05.1461457Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:05.1461798Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:05.1462116Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:05.1462427Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:05.1462727Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:05.1463062Z #define __DBL_DIG__ 15 2025-05-07T19:45:05.1463328Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:05.1463659Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:05.1463927Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:05.1464211Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.1464479Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:05.1465385Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:05.1465873Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:05.1466154Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:05.1466518Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:05.1466827Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:05.1467171Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:05.1467531Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:05.1467901Z #define __ELF__ 1 2025-05-07T19:45:05.1468176Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:05.1468514Z #define __FLOAT128__ 1 2025-05-07T19:45:05.1468798Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:05.1469170Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:05.1469572Z #define __FLT16_DIG__ 3 2025-05-07T19:45:05.1469866Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:05.1470228Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:05.1470539Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:05.1470875Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.1471166Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:05.1471554Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:05.1471827Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:05.1472114Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:05.1472399Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:05.1472698Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:05.1472991Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:05.1473295Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:05.1473596Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:05.1473891Z #define __FLT_DIG__ 6 2025-05-07T19:45:05.1474148Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:05.1474459Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:05.1474775Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:05.1475074Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.1475395Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:05.1475708Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:05.1476004Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:05.1476319Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:05.1476627Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:05.1476954Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:05.1477251Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:05.1477582Z #define __FLT_RADIX__ 2 2025-05-07T19:45:05.1477843Z #define __FXSR__ 1 2025-05-07T19:45:05.1478145Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:05.1478623Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:05.1479000Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:05.1479380Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:05.1479721Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:05.1480078Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:05.1480406Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:05.1480762Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:05.1481069Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:05.1481395Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:05.1481719Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:05.1482059Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:05.1482368Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:05.1482696Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:05.1483051Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:05.1483388Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:05.1483740Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:05.1484049Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:05.1484320Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:05.1484593Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:05.1484868Z #define __GNUC__ 4 2025-05-07T19:45:05.1485099Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:05.1485375Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:05.1485634Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:05.1485999Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:05.1486247Z #define __INT16_MAX__ 32767 2025-05-07T19:45:05.1486560Z #define __INT16_TYPE__ short 2025-05-07T19:45:05.1486817Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:05.1487052Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:05.1487301Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:05.1487538Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:05.1487804Z #define __INT32_TYPE__ int 2025-05-07T19:45:05.1488041Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:05.1488298Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:05.1488539Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:05.1488799Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.1489091Z #define __INT64_TYPE__ long int 2025-05-07T19:45:05.1489340Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:05.1489587Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:05.1489822Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:05.1490068Z #define __INT8_MAX__ 127 2025-05-07T19:45:05.1490303Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:05.1490577Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:05.1490830Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:05.1491090Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:05.1491345Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:05.1491644Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:05.1491912Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:05.1492156Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:05.1492417Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:05.1492675Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:05.1492990Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:05.1493246Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:05.1493502Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:05.1493757Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:05.1494023Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:05.1494279Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:05.1494558Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:05.1494822Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:05.1495073Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:05.1495359Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:05.1495630Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:05.1495892Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:05.1496143Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:05.1496418Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:05.1496701Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.1497020Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:05.1497375Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:05.1497649Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:05.1497923Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:05.1498179Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:05.1498461Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:05.1498739Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:05.1499012Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:05.1499279Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:05.1499561Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:05.1499833Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:05.1500115Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:05.1500377Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:05.1500652Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:05.1500932Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:05.1501209Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:05.1501488Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:05.1501751Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:05.1502034Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:05.1502318Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.1502639Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:05.1502911Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:05.1503180Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:05.1503452Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:05.1503709Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:05.1503978Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:05.1504330Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:05.1504587Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:05.1504825Z #define __INT_WIDTH__ 32 2025-05-07T19:45:05.1505084Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:05.1505580Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:05.1505970Z #define __LDBL_DIG__ 18 2025-05-07T19:45:05.1506272Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:05.1506839Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:05.1507174Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:05.1507479Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.1507817Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:05.1508100Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:05.1508388Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:05.1508681Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:05.1509023Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:05.1509312Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:05.1509633Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:05.1509965Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:05.1510246Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:05.1510539Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:05.1510863Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:05.1511174Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:05.1511496Z #define __LP64__ 1 2025-05-07T19:45:05.1511752Z #define __MMX__ 1 2025-05-07T19:45:05.1511982Z #define __NO_INLINE__ 1 2025-05-07T19:45:05.1512268Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:05.1512570Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:05.1512889Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:05.1513233Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:05.1513564Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:05.1513905Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:05.1514228Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:05.1514557Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:05.1514844Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:05.1515153Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:05.1515422Z #define __PIC__ 2 2025-05-07T19:45:05.1515653Z #define __PIE__ 2 2025-05-07T19:45:05.1515881Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:05.1516171Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:05.1516465Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:05.1516832Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:05.1517134Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:05.1517446Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:05.1517743Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:05.1518008Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:05.1518286Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:05.1518527Z #define __SEG_FS 1 2025-05-07T19:45:05.1518759Z #define __SEG_GS 1 2025-05-07T19:45:05.1518981Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:05.1519246Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:05.1519513Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:05.1519820Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:05.1520102Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:05.1520363Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:05.1520643Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:05.1520897Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:05.1521171Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:05.1521437Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:05.1521739Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:05.1522009Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:05.1522279Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:05.1522551Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:05.1522833Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:05.1523099Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:05.1523360Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:05.1523638Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:05.1524005Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:05.1524353Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:05.1524607Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:05.1524861Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:05.1525162Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.1525459Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:05.1525754Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:05.1526005Z #define __SSE2_MATH__ 1 2025-05-07T19:45:05.1526227Z #define __SSE2__ 1 2025-05-07T19:45:05.1526454Z #define __SSE_MATH__ 1 2025-05-07T19:45:05.1526672Z #define __SSE__ 1 2025-05-07T19:45:05.1526897Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:05.1527133Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:05.1527387Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:05.1527631Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:05.1527898Z #define __STDC__ 1 2025-05-07T19:45:05.1528113Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:05.1528378Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:05.1528622Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:05.1528880Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:05.1529145Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:05.1529386Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:05.1529669Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:05.1529950Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:05.1530217Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:05.1530460Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:05.1530723Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:05.1530967Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:05.1531225Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:05.1531665Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:05.1531962Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:05.1532233Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:05.1532488Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:05.1532755Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:05.1533008Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:05.1533291Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.1533774Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:05.1534098Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:05.1534353Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:05.1534629Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:05.1534887Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:05.1535159Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:05.1535437Z #define __UINT8_MAX__ 255 2025-05-07T19:45:05.1535699Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:05.1536078Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:05.1536362Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:05.1536647Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:05.1536913Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:05.1537207Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:05.1537494Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.1537842Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:05.1538153Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:05.1538439Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:05.1538729Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:05.1538996Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:05.1539280Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:05.1539566Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.1539922Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:05.1540231Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:05.1540513Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:05.1540803Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:05.1541097Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:05.1541375Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:05.1541665Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:05.1541973Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:05.1542285Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:05.1542575Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:05.1542848Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:05.1543135Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:05.1543482Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:05.1543806Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:05.1544116Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:05.1544411Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:05.1544709Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:05.1544987Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:05.1545313Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.1545775Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:05.1546108Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:05.1546388Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:05.1546673Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:05.1546941Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:05.1547229Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:05.1547505Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:05.1547819Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:05.1548112Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:05.1548392Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:05.1548685Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:05.1548958Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:05.1549265Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:05.1549574Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:05.1549864Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:05.1550143Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:05.1550431Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:05.1550713Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:05.1551041Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:05.1551419Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:05.1551712Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:05.1552174Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:05.1552472Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:05.1552867Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.1553232Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:05.1553575Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:05.1553858Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:05.1554164Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:05.1554458Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:05.1554740Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:05.1555047Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:05.1555437Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:05.1556090Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:05.1556730Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:05.1557028Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:05.1557290Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:05.1557567Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:05.1557864Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:05.1558147Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:05.1558437Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:05.1558685Z #define __amd64 1 2025-05-07T19:45:05.1558926Z #define __amd64__ 1 2025-05-07T19:45:05.1559149Z #define __clang__ 1 2025-05-07T19:45:05.1559415Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:05.1559725Z #define __clang_major__ 16 2025-05-07T19:45:05.1559993Z #define __clang_minor__ 0 2025-05-07T19:45:05.1560251Z #define __clang_patchlevel__ 6 2025-05-07T19:45:05.1560885Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:05.1561567Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:05.1561908Z #define __code_model_small__ 1 2025-05-07T19:45:05.1562189Z #define __gnu_linux__ 1 2025-05-07T19:45:05.1562426Z #define __k8 1 2025-05-07T19:45:05.1562655Z #define __k8__ 1 2025-05-07T19:45:05.1562868Z #define __linux 1 2025-05-07T19:45:05.1563096Z #define __linux__ 1 2025-05-07T19:45:05.1563313Z #define __llvm__ 1 2025-05-07T19:45:05.1563608Z #define __pic__ 2 2025-05-07T19:45:05.1563820Z #define __pie__ 2 2025-05-07T19:45:05.1564099Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:05.1564497Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:05.1564981Z #define __tune_k8__ 1 2025-05-07T19:45:05.1565224Z #define __unix 1 2025-05-07T19:45:05.1565436Z #define __unix__ 1 2025-05-07T19:45:05.1565666Z #define __x86_64 1 2025-05-07T19:45:05.1565886Z #define __x86_64__ 1 2025-05-07T19:45:05.1566124Z #define linux 1 2025-05-07T19:45:05.1566337Z #define unix 1 2025-05-07T19:45:05.1566483Z 2025-05-07T19:45:05.2037875Z 2025-05-07T19:45:05.2038395Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:05.2038928Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:05.2039174Z 2025-05-07T19:45:06.8112074Z #define _GNU_SOURCE 1 2025-05-07T19:45:06.8112414Z #define _LP64 1 2025-05-07T19:45:06.8112824Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:06.8113157Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:06.8113422Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:06.8113715Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:06.8113975Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:06.8114247Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:06.8114544Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:06.8114843Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:06.8115158Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:06.8115475Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:06.8115845Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:06.8116165Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:06.8116473Z #define __CHAR_BIT__ 8 2025-05-07T19:45:06.8116733Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:06.8117091Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:06.8117437Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:06.8117798Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:06.8118153Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:06.8118472Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:06.8118827Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:06.8119153Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:06.8119499Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:06.8119826Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:06.8120165Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:06.8120736Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:06.8121078Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:06.8121429Z #define __DBL_DIG__ 15 2025-05-07T19:45:06.8121701Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:06.8122050Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:06.8122324Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:06.8122620Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.8122896Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:06.8123177Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:06.8123460Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:06.8123753Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:06.8124064Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:06.8124369Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:06.8124669Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:06.8124996Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:06.8125446Z #define __DEPRECATED 1 2025-05-07T19:45:06.8125693Z #define __ELF__ 1 2025-05-07T19:45:06.8125936Z #define __EXCEPTIONS 1 2025-05-07T19:45:06.8126187Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:06.8126468Z #define __FLOAT128__ 1 2025-05-07T19:45:06.8126859Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:06.8127186Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:06.8127537Z #define __FLT16_DIG__ 3 2025-05-07T19:45:06.8127794Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:06.8128116Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:06.8128389Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:06.8128860Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.8129141Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:06.8129431Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:06.8129701Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:06.8129990Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:06.8130280Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:06.8130584Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:06.8130887Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:06.8131183Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:06.8131486Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:06.8131781Z #define __FLT_DIG__ 6 2025-05-07T19:45:06.8132045Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:06.8132343Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:06.8132628Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:06.8132903Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.8133197Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:06.8133469Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:06.8133754Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:06.8134030Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:06.8134317Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:06.8134611Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:06.8134885Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:06.8135181Z #define __FLT_RADIX__ 2 2025-05-07T19:45:06.8135415Z #define __FXSR__ 1 2025-05-07T19:45:06.8135678Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:06.8135976Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:06.8136302Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:06.8136623Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:06.8136964Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:06.8137277Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:06.8137576Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:06.8137893Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:06.8138202Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:06.8138533Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:06.8138847Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:06.8139185Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:06.8139492Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:06.8139818Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:06.8140171Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:06.8162196Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:06.8162701Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:06.8163045Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:06.8163385Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:06.8163717Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:06.8163986Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:06.8164271Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:06.8164539Z #define __GNUC__ 4 2025-05-07T19:45:06.8165210Z #define __GNUG__ 4 2025-05-07T19:45:06.8165472Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:06.8165854Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:06.8166152Z #define __GXX_RTTI 1 2025-05-07T19:45:06.8166417Z #define __GXX_WEAK__ 1 2025-05-07T19:45:06.8166673Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:06.8166964Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:06.8167230Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:06.8167509Z #define __INT16_MAX__ 32767 2025-05-07T19:45:06.8167803Z #define __INT16_TYPE__ short 2025-05-07T19:45:06.8168081Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:06.8168369Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:06.8168637Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:06.8168927Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:06.8169209Z #define __INT32_TYPE__ int 2025-05-07T19:45:06.8169494Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:06.8169768Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:06.8170050Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:06.8170327Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:06.8170818Z #define __INT64_TYPE__ long int 2025-05-07T19:45:06.8171342Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:06.8171584Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:06.8171844Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:06.8172085Z #define __INT8_MAX__ 127 2025-05-07T19:45:06.8172350Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:06.8172623Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:06.8172904Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:06.8173158Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:06.8173444Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:06.8173740Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:06.8174022Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:06.8174292Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:06.8174551Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:06.8174837Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:06.8175135Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:06.8175418Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:06.8175674Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:06.8175962Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:06.8176222Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:06.8176504Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:06.8176774Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:06.8177051Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:06.8177329Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:06.8177600Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:06.8177897Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:06.8178158Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:06.8178436Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:06.8178701Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:06.8178999Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:06.8179314Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:06.8179613Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:06.8179877Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:06.8180163Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:06.8180449Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:06.8180723Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:06.8181027Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:06.8181290Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:06.8181584Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:06.8181854Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:06.8182315Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:06.8182599Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:06.8182901Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:06.8183170Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:06.8183459Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:06.8183766Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:06.8184033Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:06.8184315Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:06.8184586Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:06.8184898Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:06.8185213Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:06.8185512Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:06.8185780Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:06.8186065Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:06.8186335Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:06.8186619Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:06.8186916Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:06.8187185Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:06.8187437Z #define __INT_WIDTH__ 32 2025-05-07T19:45:06.8187695Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:06.8187998Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:06.8188338Z #define __LDBL_DIG__ 18 2025-05-07T19:45:06.8188603Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:06.8188934Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:06.8189196Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:06.8189561Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.8189846Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:06.8190100Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:06.8190380Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:06.8190662Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:06.8190993Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:06.8191268Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:06.8191718Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:06.8192231Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:06.8192609Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:06.8192894Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:06.8193248Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:06.8193578Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:06.8193836Z #define __LP64__ 1 2025-05-07T19:45:06.8194074Z #define __MMX__ 1 2025-05-07T19:45:06.8194303Z #define __NO_INLINE__ 1 2025-05-07T19:45:06.8194571Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:06.8194846Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:06.8195170Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:06.8195519Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:06.8195866Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:06.8196223Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:06.8196557Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:06.8196898Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:06.8197193Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:06.8197513Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:06.8197790Z #define __PIC__ 2 2025-05-07T19:45:06.8198144Z #define __PIE__ 2 2025-05-07T19:45:06.8198368Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:06.8198649Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:06.8198932Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:06.8199209Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:06.8199502Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:06.8199803Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:06.8200092Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:06.8200345Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:06.8200612Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:06.8200848Z #define __SEG_FS 1 2025-05-07T19:45:06.8201079Z #define __SEG_GS 1 2025-05-07T19:45:06.8201298Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:06.8201562Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:06.8201918Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:06.8202222Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:06.8202482Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:06.8202755Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:06.8203031Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:06.8203278Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:06.8203541Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:06.8203787Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:06.8204073Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:06.8204331Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:06.8204588Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:06.8204845Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:06.8205113Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:06.8205356Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:06.8205622Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:06.8205890Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:06.8206137Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:06.8206401Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:06.8206644Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:06.8206903Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:06.8207156Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.8207473Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:06.8207757Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:06.8208015Z #define __SSE2_MATH__ 1 2025-05-07T19:45:06.8208241Z #define __SSE2__ 1 2025-05-07T19:45:06.8208471Z #define __SSE_MATH__ 1 2025-05-07T19:45:06.8208710Z #define __SSE__ 1 2025-05-07T19:45:06.8210581Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:06.8210915Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:06.8211171Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:06.8211432Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:06.8211674Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:06.8211933Z #define __STDC__ 1 2025-05-07T19:45:06.8212158Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:06.8212426Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:06.8212693Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:06.8213147Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:06.8213416Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:06.8213704Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:06.8214000Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:06.8214304Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:06.8214586Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:06.8214840Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:06.8215111Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:06.8215368Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:06.8215655Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:06.8215942Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:06.8216255Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:06.8216526Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:06.8216805Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:06.8217086Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:06.8217346Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:06.8217644Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.8217964Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:06.8218287Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:06.8218548Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:06.8218955Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:06.8219201Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:06.8219459Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:06.8219704Z #define __UINT8_MAX__ 255 2025-05-07T19:45:06.8219970Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:06.8220263Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:06.8220528Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:06.8220800Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:06.8221058Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:06.8221328Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:06.8221604Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.8221937Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:06.8222233Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:06.8222601Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:06.8222870Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:06.8223150Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:06.8223427Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:06.8223705Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.8224042Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:06.8224340Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:06.8224614Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:06.8224886Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:06.8225176Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:06.8225443Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:06.8225728Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:06.8226027Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:06.8226328Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:06.8226609Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:06.8226875Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:06.8227157Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:06.8227424Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:06.8227733Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:06.8228030Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:06.8228315Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:06.8228582Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:06.8228863Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:06.8229172Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.8229503Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:06.8229922Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:06.8230190Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:06.8230474Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:06.8230742Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:06.8231031Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:06.8231303Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:06.8231733Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:06.8232213Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:06.8232528Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:06.8232842Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:06.8233155Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:06.8233463Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:06.8233808Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:06.8234115Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:06.8234406Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:06.8234721Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:06.8235014Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:06.8235350Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:06.8235668Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:06.8235977Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:06.8236264Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:06.8236570Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:06.8236889Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.8237268Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:06.8237623Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:06.8237917Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:06.8238225Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:06.8238518Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:06.8238823Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:06.8239122Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:06.8239457Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:06.8240102Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:06.8240766Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:06.8241067Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:06.8241332Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:06.8241797Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:06.8242177Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:06.8242493Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:06.8242760Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:06.8243029Z #define __amd64 1 2025-05-07T19:45:06.8243260Z #define __amd64__ 1 2025-05-07T19:45:06.8243507Z #define __clang__ 1 2025-05-07T19:45:06.8243768Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:06.8244100Z #define __clang_major__ 16 2025-05-07T19:45:06.8244477Z #define __clang_minor__ 0 2025-05-07T19:45:06.8244724Z #define __clang_patchlevel__ 6 2025-05-07T19:45:06.8245318Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:06.8245947Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:06.8246286Z #define __code_model_small__ 1 2025-05-07T19:45:06.8246547Z #define __cplusplus 201703L 2025-05-07T19:45:06.8246831Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:06.8247121Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:06.8247424Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:06.8247720Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:06.8247994Z #define __cpp_attributes 200809L 2025-05-07T19:45:06.8248281Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:06.8248567Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:06.8248870Z #define __cpp_constexpr 201603L 2025-05-07T19:45:06.8249152Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:06.8249468Z #define __cpp_decltype 200707L 2025-05-07T19:45:06.8249735Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:06.8250123Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:06.8250437Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:06.8250775Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:06.8251098Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:06.8251406Z #define __cpp_exceptions 199711L 2025-05-07T19:45:06.8251706Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:06.8252000Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:06.8252320Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:06.8252626Z #define __cpp_hex_float 201603L 2025-05-07T19:45:06.8252908Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:06.8253204Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:06.8253545Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:06.8253876Z #define __cpp_init_captures 201304L 2025-05-07T19:45:06.8254164Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:06.8254480Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:06.8254771Z #define __cpp_lambdas 200907L 2025-05-07T19:45:06.8255069Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:06.8255391Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:06.8255745Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:06.8256093Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:06.8256427Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:06.8256792Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:06.8257123Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:06.8257402Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:06.8257685Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:06.8257978Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:06.8258277Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:06.8258589Z #define __cpp_rtti 199711L 2025-05-07T19:45:06.8258848Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:06.8259157Z #define __cpp_static_assert 201411L 2025-05-07T19:45:06.8259453Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:06.8259777Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:06.8260096Z #define __cpp_template_auto 201606L 2025-05-07T19:45:06.8260391Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:06.8260719Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:06.8261019Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:06.8261452Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:06.8261770Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:06.8262087Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:06.8262384Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:06.8262675Z #define __gnu_linux__ 1 2025-05-07T19:45:06.8262914Z #define __k8 1 2025-05-07T19:45:06.8263113Z #define __k8__ 1 2025-05-07T19:45:06.8263334Z #define __linux 1 2025-05-07T19:45:06.8263540Z #define __linux__ 1 2025-05-07T19:45:06.8263765Z #define __llvm__ 1 2025-05-07T19:45:06.8263976Z #define __pic__ 2 2025-05-07T19:45:06.8264192Z #define __pie__ 2 2025-05-07T19:45:06.8264411Z #define __private_extern__ extern 2025-05-07T19:45:06.8264926Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:06.8265575Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:06.8265931Z #define __tune_k8__ 1 2025-05-07T19:45:06.8266171Z #define __unix 1 2025-05-07T19:45:06.8266412Z #define __unix__ 1 2025-05-07T19:45:06.8266658Z #define __x86_64 1 2025-05-07T19:45:06.8266880Z #define __x86_64__ 1 2025-05-07T19:45:06.8267127Z #define linux 1 2025-05-07T19:45:06.8267341Z #define unix 1 2025-05-07T19:45:06.8267474Z 2025-05-07T19:45:06.8695255Z 2025-05-07T19:45:06.8695598Z + conda run -n build_binary c++ --version 2025-05-07T19:45:06.8695854Z 2025-05-07T19:45:08.4607931Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:08.4609804Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:08.4610615Z Thread model: posix 2025-05-07T19:45:08.4612093Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:08.4613998Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:08.4614834Z 2025-05-07T19:45:08.5188920Z 2025-05-07T19:45:08.5189365Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:08.5190032Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:08.5190367Z 2025-05-07T19:45:10.1676509Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:10.1680778Z 2025-05-07T19:45:10.1681552Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:10.1683277Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:10.1684284Z 2025-05-07T19:45:11.8173372Z #define __cplusplus 201703L 2025-05-07T19:45:11.8175599Z 2025-05-07T19:45:11.8175785Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:11.8266748Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:11.8267230Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:11.8268763Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:11.8269111Z env: 2025-05-07T19:45:11.8269354Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:11.8269690Z BUILD_ENV: build_binary 2025-05-07T19:45:11.8269956Z BUILD_TARGET: default 2025-05-07T19:45:11.8270186Z BUILD_VARIANT: cuda 2025-05-07T19:45:11.8270443Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:11.8270697Z ##[endgroup] 2025-05-07T19:45:12.2623881Z ################################################################################ 2025-05-07T19:45:12.2624341Z # Install Build Tools 2025-05-07T19:45:12.2624598Z # 2025-05-07T19:45:12.2637760Z # [2025-05-07T19:45:12.263Z] + install_build_tools build_binary 2025-05-07T19:45:12.2638173Z ################################################################################ 2025-05-07T19:45:12.2638488Z 2025-05-07T19:45:12.2660790Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:12.3492647Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:12.3505233Z [INSTALL] Installing build tools ... 2025-05-07T19:45:12.3532690Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:13.0664367Z Channels: 2025-05-07T19:45:13.0665493Z - conda-forge 2025-05-07T19:45:13.0665856Z Platform: linux-64 2025-05-07T19:45:16.0185869Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:19.6688370Z Solving environment: \ | / - done 2025-05-07T19:45:19.7311970Z 2025-05-07T19:45:19.7312846Z ## Package Plan ## 2025-05-07T19:45:19.7313368Z 2025-05-07T19:45:19.7314032Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:19.7315020Z 2025-05-07T19:45:19.7315331Z added / updated specs: 2025-05-07T19:45:19.7316094Z - auditwheel 2025-05-07T19:45:19.7316711Z - bazel 2025-05-07T19:45:19.7317328Z - cmake[version='>=3.30'] 2025-05-07T19:45:19.7318067Z - hypothesis 2025-05-07T19:45:19.7318679Z - jinja2 2025-05-07T19:45:19.7319232Z - make 2025-05-07T19:45:19.7319783Z - ncurses 2025-05-07T19:45:19.7320336Z - ninja 2025-05-07T19:45:19.7320902Z - openblas 2025-05-07T19:45:19.7321496Z - patchelf 2025-05-07T19:45:19.7322057Z - pyyaml 2025-05-07T19:45:19.7322635Z - rhash 2025-05-07T19:45:19.7323190Z - scikit-build 2025-05-07T19:45:19.7323814Z - wheel 2025-05-07T19:45:19.7324138Z 2025-05-07T19:45:19.7324150Z 2025-05-07T19:45:19.7324494Z The following packages will be downloaded: 2025-05-07T19:45:19.7325413Z 2025-05-07T19:45:19.7325746Z package | build 2025-05-07T19:45:19.7326268Z ---------------------------|----------------- 2025-05-07T19:45:19.7327065Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:19.7327510Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:19.7327989Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:19.7328430Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:19.7328869Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:19.7329292Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:19.7329726Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:19.7330154Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:19.7330577Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:19.7331387Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:19.7331925Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:19.7332485Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:19.7333037Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:19.7333546Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:19.7334030Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:19.7334508Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:19.7335017Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:19.7335588Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:19.7336008Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:19.7336441Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:19.7336861Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:19.7337309Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:19.7337727Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:19.7338136Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:19.7338550Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:19.7338967Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:19.7339358Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:19.7339738Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:19.7340177Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:19.7340620Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:19.7341035Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:19.7341462Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:19.7341908Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:19.7342350Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:19.7342761Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:19.7343224Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:19.7343670Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:19.7344162Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:19.7344591Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:19.7345097Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:19.7345530Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:19.7345979Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:19.7346409Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:19.7346871Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:19.7347348Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:19.7347784Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:19.7348223Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:19.7348663Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:19.7349231Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:19.7349656Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:19.7350060Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:19.7350502Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:19.7350920Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:19.7351669Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:19.7352271Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:19.7352796Z markupsafe-3.0.2 | py312h178313f_1 24 KB conda-forge 2025-05-07T19:45:19.7353274Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:19.7353704Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:19.7354193Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:19.7354663Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:19.7355140Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:19.7355602Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:19.7356057Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:19.7356493Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:19.7356946Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:19.7357432Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:19.7357884Z pyyaml-6.0.2 | py312h178313f_2 202 KB conda-forge 2025-05-07T19:45:19.7358434Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:19.7358839Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:19.7359257Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:19.7359702Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:19.7360153Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:19.7360626Z sqlite-3.46.0 | h6d4b2fc_0 840 KB conda-forge 2025-05-07T19:45:19.7361030Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:19.7361461Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:19.7361897Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:19.7362335Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:19.7362890Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:19.7363335Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:19.7363805Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:19.7364245Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:19.7364906Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:19.7365636Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:19.7366131Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:19.7366618Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:19.7367115Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:19.7367757Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:19.7368247Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:19.7368681Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:19.7369132Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:19.7369588Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:19.7370029Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:19.7370442Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:19.7370867Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:19.7371301Z ------------------------------------------------------------ 2025-05-07T19:45:19.7371773Z Total: 306.3 MB 2025-05-07T19:45:19.7372003Z 2025-05-07T19:45:19.7372129Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:19.7372352Z 2025-05-07T19:45:19.7372547Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:19.7372988Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:19.7373452Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:19.7373889Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:19.7374307Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:19.7374713Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:19.7375131Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:19.7375547Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:19.7375954Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:19.7376465Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:19.7377052Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:19.7377672Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:19.7378288Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:19.7378851Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:19.7379361Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:19.7379850Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:19.7380346Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:19.7380814Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:19.7381241Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:19.7381808Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:19.7382266Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:19.7382753Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:19.7383193Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:19.7383610Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:19.7384052Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:19.7384464Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:19.7384867Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:19.7385256Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:19.7385729Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:19.7386322Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:19.7386756Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:19.7387227Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:19.7387711Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:19.7388179Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:19.7388826Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:19.7389347Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:19.7389881Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:19.7390591Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:19.7391115Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:19.7391701Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:19.7392218Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:19.7392750Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:19.7393254Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:19.7393811Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:19.7394398Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:19.7394930Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:19.7395439Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:19.7395959Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:19.7396476Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:19.7396962Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:19.7397416Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:19.7398047Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:19.7398530Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:19.7398978Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:19.7399466Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py312h178313f_1 2025-05-07T19:45:19.7399949Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:19.7400455Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:19.7401608Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:19.7402129Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:19.7402634Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:19.7403199Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:19.7403669Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:19.7404181Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:19.7404744Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:19.7405236Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py312h178313f_2 2025-05-07T19:45:19.7405699Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:19.7406132Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:19.7406617Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:19.7407147Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:19.7407696Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:19.7408316Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:19.7408804Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:19.7409306Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:19.7409826Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:19.7410457Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:19.7411187Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:19.7411746Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:19.7412296Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:19.7412848Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:19.7413385Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:19.7413993Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:19.7414556Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:19.7415070Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:19.7415607Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:19.7416112Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:19.7416568Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:19.7416831Z 2025-05-07T19:45:19.7416972Z The following packages will be UPDATED: 2025-05-07T19:45:19.7417191Z 2025-05-07T19:45:19.7417348Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:19.7417905Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:19.7418567Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.46.0-h6d4b2fc_0 2025-05-07T19:45:19.7419275Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:19.7419925Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:19.7420395Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:19.7420814Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:19.7421066Z 2025-05-07T19:45:19.7421299Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:19.7421660Z 2025-05-07T19:45:19.7421905Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:19.7422256Z 2025-05-07T19:45:19.7422306Z 2025-05-07T19:45:19.7422310Z 2025-05-07T19:45:19.7422465Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:19.7422942Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:19.7423201Z 2025-05-07T19:45:19.7423506Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:19.7423748Z 2025-05-07T19:45:19.7423753Z 2025-05-07T19:45:19.7423977Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:19.7424225Z 2025-05-07T19:45:19.7424229Z 2025-05-07T19:45:19.7424233Z 2025-05-07T19:45:19.7429898Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:19.7430289Z 2025-05-07T19:45:19.7430515Z 2025-05-07T19:45:19.7430520Z 2025-05-07T19:45:19.7433436Z 2025-05-07T19:45:19.7452369Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:19.7453311Z 2025-05-07T19:45:19.7453326Z 2025-05-07T19:45:19.7453337Z 2025-05-07T19:45:19.7453347Z 2025-05-07T19:45:19.7453358Z 2025-05-07T19:45:19.7453997Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:19.7454522Z 2025-05-07T19:45:19.7454527Z 2025-05-07T19:45:19.7454531Z 2025-05-07T19:45:19.7454534Z 2025-05-07T19:45:19.7454538Z 2025-05-07T19:45:19.7454562Z 2025-05-07T19:45:19.7454819Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:19.7455101Z 2025-05-07T19:45:19.7455104Z 2025-05-07T19:45:19.7455108Z 2025-05-07T19:45:19.7455112Z 2025-05-07T19:45:19.7455115Z 2025-05-07T19:45:19.7455119Z 2025-05-07T19:45:19.7455123Z 2025-05-07T19:45:19.7455402Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:19.7455686Z 2025-05-07T19:45:19.7455690Z 2025-05-07T19:45:19.7455693Z 2025-05-07T19:45:19.7455697Z 2025-05-07T19:45:19.7455701Z 2025-05-07T19:45:19.7455704Z 2025-05-07T19:45:19.7455708Z 2025-05-07T19:45:19.7455711Z 2025-05-07T19:45:19.7455991Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:19.7456293Z 2025-05-07T19:45:19.7456297Z 2025-05-07T19:45:19.7456306Z 2025-05-07T19:45:19.7456315Z 2025-05-07T19:45:19.7456318Z 2025-05-07T19:45:19.7456322Z 2025-05-07T19:45:19.7456325Z 2025-05-07T19:45:19.7456329Z 2025-05-07T19:45:19.7456347Z 2025-05-07T19:45:19.7456588Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:19.7456845Z 2025-05-07T19:45:19.7456848Z 2025-05-07T19:45:19.7456852Z 2025-05-07T19:45:19.7456855Z 2025-05-07T19:45:19.7456858Z 2025-05-07T19:45:19.7456862Z 2025-05-07T19:45:19.7456865Z 2025-05-07T19:45:19.7456869Z 2025-05-07T19:45:19.7456873Z 2025-05-07T19:45:19.7456883Z 2025-05-07T19:45:19.7457632Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:19.7457953Z 2025-05-07T19:45:19.7457956Z 2025-05-07T19:45:19.7457960Z 2025-05-07T19:45:19.7457963Z 2025-05-07T19:45:19.7457968Z 2025-05-07T19:45:19.7457983Z 2025-05-07T19:45:19.7457987Z 2025-05-07T19:45:19.7457991Z 2025-05-07T19:45:19.7458010Z 2025-05-07T19:45:19.7458013Z 2025-05-07T19:45:19.7458027Z 2025-05-07T19:45:19.7458799Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:19.7459097Z 2025-05-07T19:45:19.7459101Z 2025-05-07T19:45:19.7459104Z 2025-05-07T19:45:19.7459108Z 2025-05-07T19:45:19.7459113Z 2025-05-07T19:45:19.7459116Z 2025-05-07T19:45:19.7459141Z 2025-05-07T19:45:19.7459151Z 2025-05-07T19:45:19.7459155Z 2025-05-07T19:45:19.7459159Z 2025-05-07T19:45:19.7459162Z 2025-05-07T19:45:19.7459165Z 2025-05-07T19:45:19.7459934Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:19.7460254Z 2025-05-07T19:45:19.7460273Z 2025-05-07T19:45:19.7460277Z 2025-05-07T19:45:19.7460280Z 2025-05-07T19:45:19.7460284Z 2025-05-07T19:45:19.7460287Z 2025-05-07T19:45:19.7460291Z 2025-05-07T19:45:19.7460294Z 2025-05-07T19:45:19.7460298Z 2025-05-07T19:45:19.7460310Z 2025-05-07T19:45:19.7460314Z 2025-05-07T19:45:19.7460317Z 2025-05-07T19:45:19.7460320Z 2025-05-07T19:45:19.7460741Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:19.7461047Z 2025-05-07T19:45:19.7461051Z 2025-05-07T19:45:19.7461055Z 2025-05-07T19:45:19.7461059Z 2025-05-07T19:45:19.7461074Z 2025-05-07T19:45:19.7461077Z 2025-05-07T19:45:19.7461080Z 2025-05-07T19:45:19.7461084Z 2025-05-07T19:45:19.7461087Z 2025-05-07T19:45:19.7461091Z 2025-05-07T19:45:19.7461094Z 2025-05-07T19:45:19.7461098Z 2025-05-07T19:45:19.7461102Z 2025-05-07T19:45:19.7461105Z 2025-05-07T19:45:19.7461810Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:19.7462142Z 2025-05-07T19:45:19.7462146Z 2025-05-07T19:45:19.7462149Z 2025-05-07T19:45:19.7462153Z 2025-05-07T19:45:19.7462156Z 2025-05-07T19:45:19.7462172Z 2025-05-07T19:45:19.7462176Z 2025-05-07T19:45:19.7462179Z 2025-05-07T19:45:19.7462183Z 2025-05-07T19:45:19.7462186Z 2025-05-07T19:45:19.7462190Z 2025-05-07T19:45:19.7462209Z 2025-05-07T19:45:19.7462217Z 2025-05-07T19:45:19.7462295Z 2025-05-07T19:45:19.7462299Z 2025-05-07T19:45:19.7462775Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:19.7463076Z 2025-05-07T19:45:19.7463079Z 2025-05-07T19:45:19.7463083Z 2025-05-07T19:45:19.7463086Z 2025-05-07T19:45:19.7463090Z 2025-05-07T19:45:19.7463110Z 2025-05-07T19:45:19.7463114Z 2025-05-07T19:45:19.7463128Z 2025-05-07T19:45:19.7463132Z 2025-05-07T19:45:19.7463136Z 2025-05-07T19:45:19.7463139Z 2025-05-07T19:45:19.7463143Z 2025-05-07T19:45:19.7463146Z 2025-05-07T19:45:19.7463150Z 2025-05-07T19:45:19.7463153Z 2025-05-07T19:45:19.7463157Z 2025-05-07T19:45:19.7463825Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:19.7464155Z 2025-05-07T19:45:19.7464174Z 2025-05-07T19:45:19.7464178Z 2025-05-07T19:45:19.7464181Z 2025-05-07T19:45:19.7464185Z 2025-05-07T19:45:19.7464188Z 2025-05-07T19:45:19.7464196Z 2025-05-07T19:45:19.7464203Z 2025-05-07T19:45:19.7464207Z 2025-05-07T19:45:19.7464210Z 2025-05-07T19:45:19.7464214Z 2025-05-07T19:45:19.7464217Z 2025-05-07T19:45:19.7464221Z 2025-05-07T19:45:19.7464225Z 2025-05-07T19:45:19.7464228Z 2025-05-07T19:45:19.7464232Z 2025-05-07T19:45:19.7464235Z 2025-05-07T19:45:19.7465149Z ncurses-6.5 | 871 KB | | 0%  2025-05-07T19:45:19.7465526Z 2025-05-07T19:45:19.7465530Z 2025-05-07T19:45:19.7465533Z 2025-05-07T19:45:19.7465538Z 2025-05-07T19:45:19.7465541Z 2025-05-07T19:45:19.7465545Z 2025-05-07T19:45:19.7465564Z 2025-05-07T19:45:19.7465567Z 2025-05-07T19:45:19.7465589Z 2025-05-07T19:45:19.7465592Z 2025-05-07T19:45:19.7465595Z 2025-05-07T19:45:19.7465599Z 2025-05-07T19:45:19.7465602Z 2025-05-07T19:45:19.7465606Z 2025-05-07T19:45:19.7465609Z 2025-05-07T19:45:19.7465613Z 2025-05-07T19:45:19.7465616Z 2025-05-07T19:45:19.7465620Z 2025-05-07T19:45:19.7466281Z libuv-1.50.0 | 870 KB | | 0%  2025-05-07T19:45:19.7466609Z 2025-05-07T19:45:19.7466612Z 2025-05-07T19:45:19.7466616Z 2025-05-07T19:45:19.7466633Z 2025-05-07T19:45:19.7466636Z 2025-05-07T19:45:19.7466639Z 2025-05-07T19:45:19.7466643Z 2025-05-07T19:45:19.7466646Z 2025-05-07T19:45:19.7466650Z 2025-05-07T19:45:19.7466653Z 2025-05-07T19:45:19.7466657Z 2025-05-07T19:45:19.7466660Z 2025-05-07T19:45:19.7466664Z 2025-05-07T19:45:19.7466668Z 2025-05-07T19:45:19.7466671Z 2025-05-07T19:45:19.7466675Z 2025-05-07T19:45:19.7466678Z 2025-05-07T19:45:19.7466682Z 2025-05-07T19:45:19.7466685Z 2025-05-07T19:45:20.1697305Z ... (more hidden) ... 2025-05-07T19:45:20.1698248Z 2025-05-07T19:45:20.1698262Z 2025-05-07T19:45:20.1698273Z 2025-05-07T19:45:20.1845705Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:20.1846587Z 2025-05-07T19:45:20.1863935Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:20.1865633Z 2025-05-07T19:45:20.1865646Z 2025-05-07T19:45:20.1958604Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:20.1959421Z 2025-05-07T19:45:20.1959436Z 2025-05-07T19:45:20.1959446Z 2025-05-07T19:45:20.1959458Z 2025-05-07T19:45:20.2249844Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:20.2786183Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:20.2787004Z 2025-05-07T19:45:20.2787021Z 2025-05-07T19:45:20.2787033Z 2025-05-07T19:45:20.2844336Z libgrpc-1.71.0 | 7.6 MB | ######9 | 70%  2025-05-07T19:45:20.2845187Z 2025-05-07T19:45:20.2863643Z bazel-7.5.0 | 47.4 MB | ##3 | 24%  2025-05-07T19:45:20.2863934Z 2025-05-07T19:45:20.2863938Z 2025-05-07T19:45:20.2960472Z cmake-4.0.2 | 19.4 MB | ####9 | 50%  2025-05-07T19:45:20.2961276Z 2025-05-07T19:45:20.2961289Z 2025-05-07T19:45:20.2961332Z 2025-05-07T19:45:20.2961743Z 2025-05-07T19:45:20.3251571Z openblas-0.3.29 | 5.8 MB | ########9 | 89%  2025-05-07T19:45:20.3497441Z openjdk-23.0.1 | 181.3 MB | 4 | 4% 2025-05-07T19:45:20.3498233Z 2025-05-07T19:45:20.3498247Z 2025-05-07T19:45:20.3498259Z 2025-05-07T19:45:20.3498270Z 2025-05-07T19:45:20.3848292Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:20.3849193Z 2025-05-07T19:45:20.4214961Z bazel-7.5.0 | 47.4 MB | ####7 | 47%  2025-05-07T19:45:20.4215249Z 2025-05-07T19:45:20.4215268Z 2025-05-07T19:45:20.4215272Z 2025-05-07T19:45:20.4215275Z 2025-05-07T19:45:20.4215279Z 2025-05-07T19:45:20.4251572Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:20.4481731Z openjdk-23.0.1 | 181.3 MB | 9 | 10% 2025-05-07T19:45:20.4482519Z 2025-05-07T19:45:20.4482534Z 2025-05-07T19:45:20.4482546Z 2025-05-07T19:45:20.4837779Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:20.4838684Z 2025-05-07T19:45:20.4838699Z 2025-05-07T19:45:20.4838710Z 2025-05-07T19:45:20.4838720Z 2025-05-07T19:45:20.4838731Z 2025-05-07T19:45:20.4838741Z 2025-05-07T19:45:20.4845530Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:20.4846613Z 2025-05-07T19:45:20.5537919Z bazel-7.5.0 | 47.4 MB | ######6 | 67%  2025-05-07T19:45:20.6119297Z openjdk-23.0.1 | 181.3 MB | #3 | 13% 2025-05-07T19:45:20.6119572Z 2025-05-07T19:45:20.6168174Z bazel-7.5.0 | 47.4 MB | ########5 | 85%  2025-05-07T19:45:20.6168442Z 2025-05-07T19:45:20.6168446Z 2025-05-07T19:45:20.6168452Z 2025-05-07T19:45:20.6168457Z 2025-05-07T19:45:20.6168469Z 2025-05-07T19:45:20.6168747Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:20.6169050Z 2025-05-07T19:45:20.6169054Z 2025-05-07T19:45:20.6169058Z 2025-05-07T19:45:20.6169062Z 2025-05-07T19:45:20.6169066Z 2025-05-07T19:45:20.6395845Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:20.6396195Z 2025-05-07T19:45:20.6396200Z 2025-05-07T19:45:20.6396205Z 2025-05-07T19:45:20.6396208Z 2025-05-07T19:45:20.6396212Z 2025-05-07T19:45:20.6396954Z 2025-05-07T19:45:20.6398779Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:20.6399086Z 2025-05-07T19:45:20.6399090Z 2025-05-07T19:45:20.6399094Z 2025-05-07T19:45:20.6399097Z 2025-05-07T19:45:20.6399101Z 2025-05-07T19:45:20.6399104Z 2025-05-07T19:45:20.6536883Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:20.6723111Z openjdk-23.0.1 | 181.3 MB | #6 | 17% 2025-05-07T19:45:20.6723390Z 2025-05-07T19:45:20.6723395Z 2025-05-07T19:45:20.6723681Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:20.6723934Z 2025-05-07T19:45:20.6723938Z 2025-05-07T19:45:20.6780404Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:20.6780961Z 2025-05-07T19:45:20.6780967Z 2025-05-07T19:45:20.6780970Z 2025-05-07T19:45:20.6780974Z 2025-05-07T19:45:20.6780977Z 2025-05-07T19:45:20.6780981Z 2025-05-07T19:45:20.6780985Z 2025-05-07T19:45:20.6825948Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:20.6826867Z 2025-05-07T19:45:20.6826880Z 2025-05-07T19:45:20.6826891Z 2025-05-07T19:45:20.6826902Z 2025-05-07T19:45:20.6826913Z 2025-05-07T19:45:20.6826924Z 2025-05-07T19:45:20.6826934Z 2025-05-07T19:45:20.6826945Z 2025-05-07T19:45:20.7199922Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:20.7200265Z 2025-05-07T19:45:20.7200282Z 2025-05-07T19:45:20.7200285Z 2025-05-07T19:45:20.7200289Z 2025-05-07T19:45:20.7200293Z 2025-05-07T19:45:20.7200297Z 2025-05-07T19:45:20.7200300Z 2025-05-07T19:45:20.7200304Z 2025-05-07T19:45:20.7205123Z 2025-05-07T19:45:20.7799715Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:20.7864101Z openjdk-23.0.1 | 181.3 MB | ## | 20% 2025-05-07T19:45:20.7864384Z 2025-05-07T19:45:20.7864495Z 2025-05-07T19:45:20.7864501Z 2025-05-07T19:45:20.7864506Z 2025-05-07T19:45:20.7864512Z 2025-05-07T19:45:20.7864523Z 2025-05-07T19:45:20.7864549Z 2025-05-07T19:45:20.7865589Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:20.7865875Z 2025-05-07T19:45:20.7865878Z 2025-05-07T19:45:20.7865886Z 2025-05-07T19:45:20.7865890Z 2025-05-07T19:45:20.7865893Z 2025-05-07T19:45:20.7865896Z 2025-05-07T19:45:20.7866587Z 2025-05-07T19:45:20.7894046Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:20.7894385Z 2025-05-07T19:45:20.7894390Z 2025-05-07T19:45:20.7894393Z 2025-05-07T19:45:20.7894397Z 2025-05-07T19:45:20.7894401Z 2025-05-07T19:45:20.7894404Z 2025-05-07T19:45:20.7894408Z 2025-05-07T19:45:20.7894423Z 2025-05-07T19:45:20.7894704Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:20.7895007Z 2025-05-07T19:45:20.7895011Z 2025-05-07T19:45:20.7895015Z 2025-05-07T19:45:20.7895018Z 2025-05-07T19:45:20.7895022Z 2025-05-07T19:45:20.7895025Z 2025-05-07T19:45:20.7895029Z 2025-05-07T19:45:20.7895032Z 2025-05-07T19:45:20.8110100Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:20.8110496Z 2025-05-07T19:45:20.8110646Z 2025-05-07T19:45:20.8110649Z 2025-05-07T19:45:20.8110662Z 2025-05-07T19:45:20.8110666Z 2025-05-07T19:45:20.8110686Z 2025-05-07T19:45:20.8110704Z 2025-05-07T19:45:20.8110708Z 2025-05-07T19:45:20.8110723Z 2025-05-07T19:45:20.8257281Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:20.8257726Z 2025-05-07T19:45:20.8257739Z 2025-05-07T19:45:20.8257743Z 2025-05-07T19:45:20.8257771Z 2025-05-07T19:45:20.8257798Z 2025-05-07T19:45:20.8257892Z 2025-05-07T19:45:20.8257897Z 2025-05-07T19:45:20.8257920Z 2025-05-07T19:45:20.8257938Z 2025-05-07T19:45:20.8257942Z 2025-05-07T19:45:20.8257947Z 2025-05-07T19:45:20.8468877Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:20.8469204Z 2025-05-07T19:45:20.8469208Z 2025-05-07T19:45:20.8469212Z 2025-05-07T19:45:20.8469216Z 2025-05-07T19:45:20.8469219Z 2025-05-07T19:45:20.8469223Z 2025-05-07T19:45:20.8469227Z 2025-05-07T19:45:20.8469230Z 2025-05-07T19:45:20.8469234Z 2025-05-07T19:45:20.8469237Z 2025-05-07T19:45:20.8509014Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:20.8509405Z 2025-05-07T19:45:20.8509410Z 2025-05-07T19:45:20.8509415Z 2025-05-07T19:45:20.8509419Z 2025-05-07T19:45:20.8509424Z 2025-05-07T19:45:20.8509428Z 2025-05-07T19:45:20.8509433Z 2025-05-07T19:45:20.8509438Z 2025-05-07T19:45:20.8509442Z 2025-05-07T19:45:20.8509447Z 2025-05-07T19:45:20.8509451Z 2025-05-07T19:45:20.8509468Z 2025-05-07T19:45:20.8799839Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:20.8832851Z openjdk-23.0.1 | 181.3 MB | ##5 | 25% 2025-05-07T19:45:20.8833179Z 2025-05-07T19:45:20.8833184Z 2025-05-07T19:45:20.8833188Z 2025-05-07T19:45:20.8833191Z 2025-05-07T19:45:20.8833195Z 2025-05-07T19:45:20.8833199Z 2025-05-07T19:45:20.8833202Z 2025-05-07T19:45:20.8833206Z 2025-05-07T19:45:20.8833210Z 2025-05-07T19:45:20.8833213Z 2025-05-07T19:45:20.8833217Z 2025-05-07T19:45:20.8912426Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.8912784Z 2025-05-07T19:45:20.8912789Z 2025-05-07T19:45:20.8912794Z 2025-05-07T19:45:20.8912798Z 2025-05-07T19:45:20.8912803Z 2025-05-07T19:45:20.8912808Z 2025-05-07T19:45:20.8912813Z 2025-05-07T19:45:20.8912818Z 2025-05-07T19:45:20.8912822Z 2025-05-07T19:45:20.8912827Z 2025-05-07T19:45:20.8912832Z 2025-05-07T19:45:20.8912836Z 2025-05-07T19:45:20.8945517Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.8945898Z 2025-05-07T19:45:20.8945903Z 2025-05-07T19:45:20.8945907Z 2025-05-07T19:45:20.8945910Z 2025-05-07T19:45:20.8945914Z 2025-05-07T19:45:20.8945917Z 2025-05-07T19:45:20.8945921Z 2025-05-07T19:45:20.8945924Z 2025-05-07T19:45:20.8945941Z 2025-05-07T19:45:20.8945944Z 2025-05-07T19:45:20.9246876Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.9247382Z 2025-05-07T19:45:20.9247546Z 2025-05-07T19:45:20.9247554Z 2025-05-07T19:45:20.9247558Z 2025-05-07T19:45:20.9247563Z 2025-05-07T19:45:20.9247568Z 2025-05-07T19:45:20.9247572Z 2025-05-07T19:45:20.9247577Z 2025-05-07T19:45:20.9247582Z 2025-05-07T19:45:20.9247587Z 2025-05-07T19:45:20.9247591Z 2025-05-07T19:45:20.9247596Z 2025-05-07T19:45:20.9247600Z 2025-05-07T19:45:20.9263642Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:20.9263981Z 2025-05-07T19:45:20.9264005Z 2025-05-07T19:45:20.9264009Z 2025-05-07T19:45:20.9264013Z 2025-05-07T19:45:20.9264017Z 2025-05-07T19:45:20.9264020Z 2025-05-07T19:45:20.9264024Z 2025-05-07T19:45:20.9264028Z 2025-05-07T19:45:20.9264031Z 2025-05-07T19:45:20.9264035Z 2025-05-07T19:45:20.9264038Z 2025-05-07T19:45:20.9264042Z 2025-05-07T19:45:20.9264045Z 2025-05-07T19:45:20.9264049Z 2025-05-07T19:45:20.9345458Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:20.9345830Z 2025-05-07T19:45:20.9345835Z 2025-05-07T19:45:20.9345839Z 2025-05-07T19:45:20.9345842Z 2025-05-07T19:45:20.9600234Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:20.9601109Z 2025-05-07T19:45:20.9601124Z 2025-05-07T19:45:20.9601134Z 2025-05-07T19:45:20.9601145Z 2025-05-07T19:45:20.9601156Z 2025-05-07T19:45:20.9601167Z 2025-05-07T19:45:20.9601178Z 2025-05-07T19:45:20.9601189Z 2025-05-07T19:45:20.9601200Z 2025-05-07T19:45:20.9601258Z 2025-05-07T19:45:20.9601282Z 2025-05-07T19:45:20.9601293Z 2025-05-07T19:45:20.9601304Z 2025-05-07T19:45:20.9601314Z 2025-05-07T19:45:20.9601324Z 2025-05-07T19:45:20.9712144Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:20.9712507Z 2025-05-07T19:45:20.9712512Z 2025-05-07T19:45:20.9712516Z 2025-05-07T19:45:20.9712519Z 2025-05-07T19:45:20.9712523Z 2025-05-07T19:45:20.9712526Z 2025-05-07T19:45:20.9712530Z 2025-05-07T19:45:20.9712533Z 2025-05-07T19:45:20.9712550Z 2025-05-07T19:45:20.9712554Z 2025-05-07T19:45:20.9712557Z 2025-05-07T19:45:20.9712560Z 2025-05-07T19:45:20.9712564Z 2025-05-07T19:45:20.9712567Z 2025-05-07T19:45:20.9745632Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:20.9746163Z 2025-05-07T19:45:20.9746302Z 2025-05-07T19:45:20.9746309Z 2025-05-07T19:45:20.9746313Z 2025-05-07T19:45:20.9746318Z 2025-05-07T19:45:20.9746322Z 2025-05-07T19:45:20.9746523Z 2025-05-07T19:45:20.9746539Z 2025-05-07T19:45:20.9746543Z 2025-05-07T19:45:20.9746548Z 2025-05-07T19:45:20.9746552Z 2025-05-07T19:45:20.9746557Z 2025-05-07T19:45:20.9746561Z 2025-05-07T19:45:20.9846528Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:20.9846848Z 2025-05-07T19:45:20.9846853Z 2025-05-07T19:45:20.9846857Z 2025-05-07T19:45:20.9846860Z 2025-05-07T19:45:20.9846864Z 2025-05-07T19:45:20.9846868Z 2025-05-07T19:45:20.9846871Z 2025-05-07T19:45:20.9846875Z 2025-05-07T19:45:20.9846893Z 2025-05-07T19:45:20.9846897Z 2025-05-07T19:45:20.9846900Z 2025-05-07T19:45:20.9846904Z 2025-05-07T19:45:20.9846908Z 2025-05-07T19:45:20.9846911Z 2025-05-07T19:45:20.9846915Z 2025-05-07T19:45:20.9867082Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:20.9868059Z 2025-05-07T19:45:20.9868075Z 2025-05-07T19:45:20.9868087Z 2025-05-07T19:45:20.9868098Z 2025-05-07T19:45:20.9868550Z 2025-05-07T19:45:21.0126805Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:21.0127766Z 2025-05-07T19:45:21.0127800Z 2025-05-07T19:45:21.0127812Z 2025-05-07T19:45:21.0127823Z 2025-05-07T19:45:21.0127834Z 2025-05-07T19:45:21.0127845Z 2025-05-07T19:45:21.0127855Z 2025-05-07T19:45:21.0127866Z 2025-05-07T19:45:21.0127877Z 2025-05-07T19:45:21.0127887Z 2025-05-07T19:45:21.0128017Z 2025-05-07T19:45:21.0128020Z 2025-05-07T19:45:21.0128024Z 2025-05-07T19:45:21.0128027Z 2025-05-07T19:45:21.0128031Z 2025-05-07T19:45:21.0128034Z 2025-05-07T19:45:21.0128038Z 2025-05-07T19:45:21.0128342Z ncurses-6.5 | 871 KB | 1 | 2%  2025-05-07T19:45:21.0128665Z 2025-05-07T19:45:21.0128669Z 2025-05-07T19:45:21.0128672Z 2025-05-07T19:45:21.0128675Z 2025-05-07T19:45:21.0128679Z 2025-05-07T19:45:21.0128682Z 2025-05-07T19:45:21.0128686Z 2025-05-07T19:45:21.0128690Z 2025-05-07T19:45:21.0128705Z 2025-05-07T19:45:21.0128713Z 2025-05-07T19:45:21.0128717Z 2025-05-07T19:45:21.0128720Z 2025-05-07T19:45:21.0128724Z 2025-05-07T19:45:21.0128727Z 2025-05-07T19:45:21.0128731Z 2025-05-07T19:45:21.0128734Z 2025-05-07T19:45:21.0352475Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:21.0411435Z openjdk-23.0.1 | 181.3 MB | ##8 | 29% 2025-05-07T19:45:21.0412275Z 2025-05-07T19:45:21.0412291Z 2025-05-07T19:45:21.0412303Z 2025-05-07T19:45:21.0412314Z 2025-05-07T19:45:21.0412325Z 2025-05-07T19:45:21.0412335Z 2025-05-07T19:45:21.0412346Z 2025-05-07T19:45:21.0412356Z 2025-05-07T19:45:21.0412366Z 2025-05-07T19:45:21.0412377Z 2025-05-07T19:45:21.0412387Z 2025-05-07T19:45:21.0412397Z 2025-05-07T19:45:21.0412408Z 2025-05-07T19:45:21.0412418Z 2025-05-07T19:45:21.0412428Z 2025-05-07T19:45:21.0412438Z 2025-05-07T19:45:21.0412449Z 2025-05-07T19:45:21.0412479Z 2025-05-07T19:45:21.0490687Z libuv-1.50.0 | 870 KB | 1 | 2%  2025-05-07T19:45:21.0491072Z 2025-05-07T19:45:21.0491077Z 2025-05-07T19:45:21.0491081Z 2025-05-07T19:45:21.0491084Z 2025-05-07T19:45:21.0491088Z 2025-05-07T19:45:21.0491091Z 2025-05-07T19:45:21.0491110Z 2025-05-07T19:45:21.0491113Z 2025-05-07T19:45:21.0491117Z 2025-05-07T19:45:21.0491120Z 2025-05-07T19:45:21.0491124Z 2025-05-07T19:45:21.0491128Z 2025-05-07T19:45:21.0491131Z 2025-05-07T19:45:21.0491135Z 2025-05-07T19:45:21.0491138Z 2025-05-07T19:45:21.0491142Z 2025-05-07T19:45:21.0492283Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:21.0492606Z 2025-05-07T19:45:21.0492618Z 2025-05-07T19:45:21.0492622Z 2025-05-07T19:45:21.0492626Z 2025-05-07T19:45:21.0492629Z 2025-05-07T19:45:21.0492632Z 2025-05-07T19:45:21.0492636Z 2025-05-07T19:45:21.0492639Z 2025-05-07T19:45:21.0492643Z 2025-05-07T19:45:21.0492646Z 2025-05-07T19:45:21.0492794Z 2025-05-07T19:45:21.0492803Z 2025-05-07T19:45:21.0492806Z 2025-05-07T19:45:21.0492810Z 2025-05-07T19:45:21.0492813Z 2025-05-07T19:45:21.0492817Z 2025-05-07T19:45:21.0492820Z 2025-05-07T19:45:21.0701130Z ncurses-6.5 | 871 KB | ########## | 100%  2025-05-07T19:45:21.0701473Z 2025-05-07T19:45:21.0701598Z 2025-05-07T19:45:21.0701607Z 2025-05-07T19:45:21.0701612Z 2025-05-07T19:45:21.0701617Z 2025-05-07T19:45:21.0701623Z 2025-05-07T19:45:21.0701629Z 2025-05-07T19:45:21.0701654Z 2025-05-07T19:45:21.0701658Z 2025-05-07T19:45:21.0701663Z 2025-05-07T19:45:21.0701667Z 2025-05-07T19:45:21.0701672Z 2025-05-07T19:45:21.0701676Z 2025-05-07T19:45:21.0701681Z 2025-05-07T19:45:21.0701713Z 2025-05-07T19:45:21.0701718Z 2025-05-07T19:45:21.0701722Z 2025-05-07T19:45:21.0701733Z 2025-05-07T19:45:21.1376193Z libuv-1.50.0 | 870 KB | ########## | 100%  2025-05-07T19:45:21.1377156Z 2025-05-07T19:45:21.1377602Z 2025-05-07T19:45:21.1377618Z 2025-05-07T19:45:21.1377629Z 2025-05-07T19:45:21.1377640Z 2025-05-07T19:45:21.1377650Z 2025-05-07T19:45:21.1396097Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:21.1396387Z 2025-05-07T19:45:21.1396391Z 2025-05-07T19:45:21.1396395Z 2025-05-07T19:45:21.1396398Z 2025-05-07T19:45:21.1396402Z 2025-05-07T19:45:21.1396406Z 2025-05-07T19:45:21.1396409Z 2025-05-07T19:45:21.1396413Z 2025-05-07T19:45:21.1396416Z 2025-05-07T19:45:21.1396433Z 2025-05-07T19:45:21.1396436Z 2025-05-07T19:45:21.1396440Z 2025-05-07T19:45:21.1396444Z 2025-05-07T19:45:21.1396447Z 2025-05-07T19:45:21.1396451Z 2025-05-07T19:45:21.1396454Z 2025-05-07T19:45:21.1396458Z 2025-05-07T19:45:21.1396461Z 2025-05-07T19:45:21.1396465Z 2025-05-07T19:45:21.1641721Z ... (more hidden) ... 2025-05-07T19:45:21.1737034Z openjdk-23.0.1 | 181.3 MB | ###2 | 32% 2025-05-07T19:45:21.1737527Z 2025-05-07T19:45:21.1737577Z 2025-05-07T19:45:21.1737585Z 2025-05-07T19:45:21.1737590Z 2025-05-07T19:45:21.1737626Z 2025-05-07T19:45:21.1737630Z 2025-05-07T19:45:21.1737716Z 2025-05-07T19:45:21.1737724Z 2025-05-07T19:45:21.1737728Z 2025-05-07T19:45:21.1737732Z 2025-05-07T19:45:21.1737737Z 2025-05-07T19:45:21.1737750Z 2025-05-07T19:45:21.1737756Z 2025-05-07T19:45:21.1737760Z 2025-05-07T19:45:21.1737803Z 2025-05-07T19:45:21.1737814Z 2025-05-07T19:45:21.1737818Z 2025-05-07T19:45:21.1737821Z 2025-05-07T19:45:21.1737831Z 2025-05-07T19:45:21.1865336Z ... (more hidden) ... 2025-05-07T19:45:21.1865655Z 2025-05-07T19:45:21.1865660Z 2025-05-07T19:45:21.1865663Z 2025-05-07T19:45:21.2642116Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:21.3147493Z openjdk-23.0.1 | 181.3 MB | ###5 | 35% 2025-05-07T19:45:21.3147998Z 2025-05-07T19:45:21.3644108Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:21.4729468Z openjdk-23.0.1 | 181.3 MB | ###9 | 40% 2025-05-07T19:45:21.5471922Z openjdk-23.0.1 | 181.3 MB | ####2 | 43% 2025-05-07T19:45:21.5472581Z 2025-05-07T19:45:21.5472693Z 2025-05-07T19:45:21.5472700Z 2025-05-07T19:45:21.5472705Z 2025-05-07T19:45:21.5472711Z 2025-05-07T19:45:21.5472715Z 2025-05-07T19:45:21.5472720Z 2025-05-07T19:45:21.6146834Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:21.7043531Z openjdk-23.0.1 | 181.3 MB | ####6 | 46% 2025-05-07T19:45:21.7043850Z 2025-05-07T19:45:21.7043855Z 2025-05-07T19:45:21.7043860Z 2025-05-07T19:45:21.7043865Z 2025-05-07T19:45:21.7043869Z 2025-05-07T19:45:21.7043874Z 2025-05-07T19:45:21.7043893Z 2025-05-07T19:45:21.7045292Z 2025-05-07T19:45:21.7148228Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:21.8413610Z openjdk-23.0.1 | 181.3 MB | ####9 | 49% 2025-05-07T19:45:21.8634393Z openjdk-23.0.1 | 181.3 MB | #####2 | 53% 2025-05-07T19:45:21.8634687Z 2025-05-07T19:45:21.8634692Z 2025-05-07T19:45:21.8634696Z 2025-05-07T19:45:21.8634701Z 2025-05-07T19:45:21.8634704Z 2025-05-07T19:45:21.8634708Z 2025-05-07T19:45:21.8634711Z 2025-05-07T19:45:21.8634715Z 2025-05-07T19:45:21.8634718Z 2025-05-07T19:45:21.8634722Z 2025-05-07T19:45:21.8634725Z 2025-05-07T19:45:21.8635561Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.8635860Z 2025-05-07T19:45:21.8635872Z 2025-05-07T19:45:21.8635876Z 2025-05-07T19:45:21.8635879Z 2025-05-07T19:45:21.8635883Z 2025-05-07T19:45:21.8635886Z 2025-05-07T19:45:21.8635890Z 2025-05-07T19:45:21.8635893Z 2025-05-07T19:45:21.8635897Z 2025-05-07T19:45:21.8635900Z 2025-05-07T19:45:21.8635904Z 2025-05-07T19:45:21.9517423Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.9783406Z openjdk-23.0.1 | 181.3 MB | #####5 | 55% 2025-05-07T19:45:21.9784279Z 2025-05-07T19:45:21.9784294Z 2025-05-07T19:45:21.9784305Z 2025-05-07T19:45:21.9784316Z 2025-05-07T19:45:21.9784327Z 2025-05-07T19:45:21.9784338Z 2025-05-07T19:45:21.9784348Z 2025-05-07T19:45:21.9784359Z 2025-05-07T19:45:21.9784369Z 2025-05-07T19:45:21.9784380Z 2025-05-07T19:45:21.9784390Z 2025-05-07T19:45:21.9784401Z 2025-05-07T19:45:21.9786944Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.9787884Z 2025-05-07T19:45:21.9787895Z 2025-05-07T19:45:21.9787905Z 2025-05-07T19:45:21.9787916Z 2025-05-07T19:45:21.9787926Z 2025-05-07T19:45:21.9787936Z 2025-05-07T19:45:21.9787947Z 2025-05-07T19:45:21.9787957Z 2025-05-07T19:45:21.9787967Z 2025-05-07T19:45:21.9787989Z 2025-05-07T19:45:21.9788000Z 2025-05-07T19:45:21.9788010Z 2025-05-07T19:45:22.0560794Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.0561154Z 2025-05-07T19:45:22.0561180Z 2025-05-07T19:45:22.0561184Z 2025-05-07T19:45:22.0561188Z 2025-05-07T19:45:22.0561192Z 2025-05-07T19:45:22.0561195Z 2025-05-07T19:45:22.0561199Z 2025-05-07T19:45:22.0561216Z 2025-05-07T19:45:22.0561220Z 2025-05-07T19:45:22.0561224Z 2025-05-07T19:45:22.0561956Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.0562276Z 2025-05-07T19:45:22.0562279Z 2025-05-07T19:45:22.0562283Z 2025-05-07T19:45:22.0562286Z 2025-05-07T19:45:22.0562290Z 2025-05-07T19:45:22.0562294Z 2025-05-07T19:45:22.0562312Z 2025-05-07T19:45:22.0562315Z 2025-05-07T19:45:22.0562327Z 2025-05-07T19:45:22.0562331Z 2025-05-07T19:45:22.0627718Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.0794475Z openjdk-23.0.1 | 181.3 MB | #####8 | 58% 2025-05-07T19:45:22.0794773Z 2025-05-07T19:45:22.0794778Z 2025-05-07T19:45:22.0794781Z 2025-05-07T19:45:22.0794785Z 2025-05-07T19:45:22.0794788Z 2025-05-07T19:45:22.0794812Z 2025-05-07T19:45:22.0794816Z 2025-05-07T19:45:22.0794820Z 2025-05-07T19:45:22.0794823Z 2025-05-07T19:45:22.0797691Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:22.0797962Z 2025-05-07T19:45:22.0797966Z 2025-05-07T19:45:22.0797970Z 2025-05-07T19:45:22.0797973Z 2025-05-07T19:45:22.0797977Z 2025-05-07T19:45:22.0797980Z 2025-05-07T19:45:22.0797984Z 2025-05-07T19:45:22.0797996Z 2025-05-07T19:45:22.0798000Z 2025-05-07T19:45:22.1629031Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:22.2539943Z openjdk-23.0.1 | 181.3 MB | ######2 | 62% 2025-05-07T19:45:22.2540270Z 2025-05-07T19:45:22.2540488Z 2025-05-07T19:45:22.2540498Z 2025-05-07T19:45:22.2540504Z 2025-05-07T19:45:22.2540509Z 2025-05-07T19:45:22.2540514Z 2025-05-07T19:45:22.2540522Z 2025-05-07T19:45:22.2540527Z 2025-05-07T19:45:22.2540532Z 2025-05-07T19:45:22.2540536Z 2025-05-07T19:45:22.2540553Z 2025-05-07T19:45:22.2540778Z 2025-05-07T19:45:22.2540783Z 2025-05-07T19:45:22.2541304Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.2541606Z 2025-05-07T19:45:22.2541610Z 2025-05-07T19:45:22.2541613Z 2025-05-07T19:45:22.2541617Z 2025-05-07T19:45:22.2541620Z 2025-05-07T19:45:22.2541624Z 2025-05-07T19:45:22.2541627Z 2025-05-07T19:45:22.2541631Z 2025-05-07T19:45:22.2541634Z 2025-05-07T19:45:22.2541651Z 2025-05-07T19:45:22.2541654Z 2025-05-07T19:45:22.2541658Z 2025-05-07T19:45:22.2541661Z 2025-05-07T19:45:22.3077988Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.3078914Z 2025-05-07T19:45:22.3078929Z 2025-05-07T19:45:22.3078940Z 2025-05-07T19:45:22.3078974Z 2025-05-07T19:45:22.3078985Z 2025-05-07T19:45:22.3078995Z 2025-05-07T19:45:22.3079005Z 2025-05-07T19:45:22.3079016Z 2025-05-07T19:45:22.3079026Z 2025-05-07T19:45:22.3079037Z 2025-05-07T19:45:22.3079047Z 2025-05-07T19:45:22.3079482Z 2025-05-07T19:45:22.3079498Z 2025-05-07T19:45:22.3079508Z 2025-05-07T19:45:22.3079518Z 2025-05-07T19:45:22.3080373Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:22.3081273Z 2025-05-07T19:45:22.3081284Z 2025-05-07T19:45:22.3081294Z 2025-05-07T19:45:22.3081305Z 2025-05-07T19:45:22.3081315Z 2025-05-07T19:45:22.3081325Z 2025-05-07T19:45:22.3081336Z 2025-05-07T19:45:22.3081346Z 2025-05-07T19:45:22.3081357Z 2025-05-07T19:45:22.3081367Z 2025-05-07T19:45:22.3081378Z 2025-05-07T19:45:22.3081388Z 2025-05-07T19:45:22.3081399Z 2025-05-07T19:45:22.3081409Z 2025-05-07T19:45:22.3081420Z 2025-05-07T19:45:22.3310126Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:22.4835986Z openjdk-23.0.1 | 181.3 MB | ######5 | 66% 2025-05-07T19:45:22.5353855Z openjdk-23.0.1 | 181.3 MB | ######8 | 69% 2025-05-07T19:45:22.5354642Z 2025-05-07T19:45:22.5354689Z 2025-05-07T19:45:22.5354694Z 2025-05-07T19:45:22.5354698Z 2025-05-07T19:45:22.5354701Z 2025-05-07T19:45:22.5354705Z 2025-05-07T19:45:22.5354708Z 2025-05-07T19:45:22.5354712Z 2025-05-07T19:45:22.5354728Z 2025-05-07T19:45:22.5354732Z 2025-05-07T19:45:22.5354736Z 2025-05-07T19:45:22.5354739Z 2025-05-07T19:45:22.5354743Z 2025-05-07T19:45:22.5354746Z 2025-05-07T19:45:22.5355298Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.5355644Z 2025-05-07T19:45:22.5355647Z 2025-05-07T19:45:22.5355684Z 2025-05-07T19:45:22.5355687Z 2025-05-07T19:45:22.5355691Z 2025-05-07T19:45:22.5355694Z 2025-05-07T19:45:22.5355698Z 2025-05-07T19:45:22.5355701Z 2025-05-07T19:45:22.5355705Z 2025-05-07T19:45:22.5355708Z 2025-05-07T19:45:22.5355712Z 2025-05-07T19:45:22.5355715Z 2025-05-07T19:45:22.5355719Z 2025-05-07T19:45:22.5356655Z 2025-05-07T19:45:22.6000723Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.7114107Z openjdk-23.0.1 | 181.3 MB | #######1 | 71% 2025-05-07T19:45:22.7634266Z openjdk-23.0.1 | 181.3 MB | #######3 | 74% 2025-05-07T19:45:22.7634585Z 2025-05-07T19:45:22.7634589Z 2025-05-07T19:45:22.7634593Z 2025-05-07T19:45:22.7634597Z 2025-05-07T19:45:22.7634600Z 2025-05-07T19:45:22.7634604Z 2025-05-07T19:45:22.7634607Z 2025-05-07T19:45:22.7634611Z 2025-05-07T19:45:22.7634614Z 2025-05-07T19:45:22.7634618Z 2025-05-07T19:45:22.7634621Z 2025-05-07T19:45:22.7634625Z 2025-05-07T19:45:22.7634629Z 2025-05-07T19:45:22.7634632Z 2025-05-07T19:45:22.7634636Z 2025-05-07T19:45:22.7634639Z 2025-05-07T19:45:22.7637527Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:22.7637844Z 2025-05-07T19:45:22.7637858Z 2025-05-07T19:45:22.7637862Z 2025-05-07T19:45:22.7637866Z 2025-05-07T19:45:22.7637870Z 2025-05-07T19:45:22.7637873Z 2025-05-07T19:45:22.7637877Z 2025-05-07T19:45:22.7638073Z 2025-05-07T19:45:22.7638077Z 2025-05-07T19:45:22.7638081Z 2025-05-07T19:45:22.7638084Z 2025-05-07T19:45:22.7638087Z 2025-05-07T19:45:22.7638091Z 2025-05-07T19:45:22.7638094Z 2025-05-07T19:45:22.7638098Z 2025-05-07T19:45:22.7638124Z 2025-05-07T19:45:22.7980947Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:22.7981303Z 2025-05-07T19:45:22.7981307Z 2025-05-07T19:45:22.7981310Z 2025-05-07T19:45:22.7981314Z 2025-05-07T19:45:22.7981317Z 2025-05-07T19:45:22.7981344Z 2025-05-07T19:45:22.7981348Z 2025-05-07T19:45:22.7981352Z 2025-05-07T19:45:22.7981356Z 2025-05-07T19:45:22.7981359Z 2025-05-07T19:45:22.7981363Z 2025-05-07T19:45:22.7981366Z 2025-05-07T19:45:22.7981370Z 2025-05-07T19:45:22.7981373Z 2025-05-07T19:45:22.7981377Z 2025-05-07T19:45:22.7981380Z 2025-05-07T19:45:22.7981384Z 2025-05-07T19:45:22.7981387Z 2025-05-07T19:45:22.7981956Z libuv-1.50.0 | 870 KB | ########## | 100%  2025-05-07T19:45:22.7982323Z 2025-05-07T19:45:22.7982327Z 2025-05-07T19:45:22.7982330Z 2025-05-07T19:45:22.7982334Z 2025-05-07T19:45:22.7982337Z 2025-05-07T19:45:22.7982341Z 2025-05-07T19:45:22.7982344Z 2025-05-07T19:45:22.7982348Z 2025-05-07T19:45:22.7982351Z 2025-05-07T19:45:22.7982355Z 2025-05-07T19:45:22.7982358Z 2025-05-07T19:45:22.7982362Z 2025-05-07T19:45:22.7982365Z 2025-05-07T19:45:22.7982378Z 2025-05-07T19:45:22.7982381Z 2025-05-07T19:45:22.7982384Z 2025-05-07T19:45:22.7982388Z 2025-05-07T19:45:22.7982414Z 2025-05-07T19:45:22.8360069Z libuv-1.50.0 | 870 KB | ########## | 100%  2025-05-07T19:45:22.9362865Z openjdk-23.0.1 | 181.3 MB | #######6 | 76% 2025-05-07T19:45:23.0362311Z openjdk-23.0.1 | 181.3 MB | #######9 | 80% 2025-05-07T19:45:23.1363447Z openjdk-23.0.1 | 181.3 MB | ########2 | 83% 2025-05-07T19:45:23.1913001Z openjdk-23.0.1 | 181.3 MB | ########7 | 88% 2025-05-07T19:45:23.1913313Z 2025-05-07T19:45:23.1913318Z 2025-05-07T19:45:23.1913322Z 2025-05-07T19:45:23.1913325Z 2025-05-07T19:45:23.1913329Z 2025-05-07T19:45:23.1913347Z 2025-05-07T19:45:23.1913350Z 2025-05-07T19:45:23.1913354Z 2025-05-07T19:45:23.1913357Z 2025-05-07T19:45:23.1913361Z 2025-05-07T19:45:23.1913364Z 2025-05-07T19:45:23.1913368Z 2025-05-07T19:45:23.1913371Z 2025-05-07T19:45:23.1913375Z 2025-05-07T19:45:23.1913378Z 2025-05-07T19:45:23.1913382Z 2025-05-07T19:45:23.1913385Z 2025-05-07T19:45:23.1913389Z 2025-05-07T19:45:23.1913392Z 2025-05-07T19:45:23.1913737Z ... (more hidden) ... 2025-05-07T19:45:23.1914048Z 2025-05-07T19:45:23.1914052Z 2025-05-07T19:45:23.1914055Z 2025-05-07T19:45:23.1914059Z 2025-05-07T19:45:23.1914062Z 2025-05-07T19:45:23.1914066Z 2025-05-07T19:45:23.1914069Z 2025-05-07T19:45:23.1914073Z 2025-05-07T19:45:23.1914076Z 2025-05-07T19:45:23.1914080Z 2025-05-07T19:45:23.1914089Z 2025-05-07T19:45:23.1914097Z 2025-05-07T19:45:23.1914101Z 2025-05-07T19:45:23.1914104Z 2025-05-07T19:45:23.1914108Z 2025-05-07T19:45:23.1914112Z 2025-05-07T19:45:23.1914115Z 2025-05-07T19:45:23.1914119Z 2025-05-07T19:45:23.1914122Z 2025-05-07T19:45:23.2365531Z ... (more hidden) ... 2025-05-07T19:45:23.3365455Z openjdk-23.0.1 | 181.3 MB | #########1 | 92% 2025-05-07T19:45:23.5121920Z openjdk-23.0.1 | 181.3 MB | #########5 | 96% 2025-05-07T19:45:23.9690157Z openjdk-23.0.1 | 181.3 MB | #########9 | 99% 2025-05-07T19:45:23.9690690Z 2025-05-07T19:45:23.9690767Z 2025-05-07T19:45:23.9690778Z 2025-05-07T19:45:23.9690782Z 2025-05-07T19:45:23.9690787Z 2025-05-07T19:45:23.9690791Z 2025-05-07T19:45:23.9690795Z 2025-05-07T19:45:23.9690825Z 2025-05-07T19:45:23.9690830Z 2025-05-07T19:45:23.9690848Z 2025-05-07T19:45:23.9690851Z 2025-05-07T19:45:23.9690855Z 2025-05-07T19:45:23.9690860Z 2025-05-07T19:45:23.9691078Z 2025-05-07T19:45:23.9691093Z 2025-05-07T19:45:23.9691097Z 2025-05-07T19:45:23.9691101Z 2025-05-07T19:45:23.9691534Z ncurses-6.5 | 871 KB | ########## | 100%  2025-05-07T19:45:23.9691884Z 2025-05-07T19:45:23.9691888Z 2025-05-07T19:45:23.9691891Z 2025-05-07T19:45:23.9691895Z 2025-05-07T19:45:23.9691898Z 2025-05-07T19:45:23.9691902Z 2025-05-07T19:45:23.9691905Z 2025-05-07T19:45:23.9691909Z 2025-05-07T19:45:23.9691912Z 2025-05-07T19:45:23.9691916Z 2025-05-07T19:45:23.9691919Z 2025-05-07T19:45:23.9691923Z 2025-05-07T19:45:23.9691926Z 2025-05-07T19:45:23.9691930Z 2025-05-07T19:45:23.9691933Z 2025-05-07T19:45:23.9691937Z 2025-05-07T19:45:23.9691946Z 2025-05-07T19:45:24.2877529Z ncurses-6.5 | 871 KB | ########## | 100%  2025-05-07T19:45:24.2877883Z 2025-05-07T19:45:24.2877888Z 2025-05-07T19:45:25.0729575Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:25.0730157Z 2025-05-07T19:45:25.3251409Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:26.0773997Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:26.0777425Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:26.0777689Z 2025-05-07T19:45:26.0777715Z 2025-05-07T19:45:26.0777719Z 2025-05-07T19:45:26.0777723Z 2025-05-07T19:45:26.0777755Z 2025-05-07T19:45:26.0777759Z 2025-05-07T19:45:26.0777763Z 2025-05-07T19:45:26.0777766Z 2025-05-07T19:45:26.0777770Z 2025-05-07T19:45:26.0777774Z 2025-05-07T19:45:26.0777777Z 2025-05-07T19:45:26.0777781Z 2025-05-07T19:45:26.0777784Z 2025-05-07T19:45:26.0777789Z 2025-05-07T19:45:26.0777796Z 2025-05-07T19:45:26.0777800Z 2025-05-07T19:45:26.0777803Z 2025-05-07T19:45:26.0777807Z 2025-05-07T19:45:26.0777811Z 2025-05-07T19:45:26.0777908Z 2025-05-07T19:45:26.0778356Z  2025-05-07T19:45:26.0797040Z 2025-05-07T19:45:26.0797385Z 2025-05-07T19:45:26.0797656Z  2025-05-07T19:45:26.0797886Z 2025-05-07T19:45:26.0797890Z 2025-05-07T19:45:26.0798099Z  2025-05-07T19:45:26.0798323Z 2025-05-07T19:45:26.0798327Z 2025-05-07T19:45:26.0798331Z 2025-05-07T19:45:26.0798600Z  2025-05-07T19:45:26.0798825Z 2025-05-07T19:45:26.0798828Z 2025-05-07T19:45:26.0798832Z 2025-05-07T19:45:26.0798835Z 2025-05-07T19:45:26.0799039Z  2025-05-07T19:45:26.0799264Z 2025-05-07T19:45:26.0799267Z 2025-05-07T19:45:26.0799271Z 2025-05-07T19:45:26.0799274Z 2025-05-07T19:45:26.0799278Z 2025-05-07T19:45:26.0799476Z  2025-05-07T19:45:26.0799724Z 2025-05-07T19:45:26.0799728Z 2025-05-07T19:45:26.0799743Z 2025-05-07T19:45:26.0799747Z 2025-05-07T19:45:26.0799751Z 2025-05-07T19:45:26.0799754Z 2025-05-07T19:45:26.0799945Z  2025-05-07T19:45:26.0800200Z 2025-05-07T19:45:26.0800203Z 2025-05-07T19:45:26.0800207Z 2025-05-07T19:45:26.0800210Z 2025-05-07T19:45:26.0800214Z 2025-05-07T19:45:26.0800217Z 2025-05-07T19:45:26.0800221Z 2025-05-07T19:45:26.0800421Z  2025-05-07T19:45:26.0800658Z 2025-05-07T19:45:26.0800662Z 2025-05-07T19:45:26.0800684Z 2025-05-07T19:45:26.0800688Z 2025-05-07T19:45:26.0800692Z 2025-05-07T19:45:26.0800695Z 2025-05-07T19:45:26.0800699Z 2025-05-07T19:45:26.0800702Z 2025-05-07T19:45:26.0800898Z  2025-05-07T19:45:26.0801143Z 2025-05-07T19:45:26.0801147Z 2025-05-07T19:45:26.0801150Z 2025-05-07T19:45:26.0801154Z 2025-05-07T19:45:26.0801407Z 2025-05-07T19:45:26.0801415Z 2025-05-07T19:45:26.0801419Z 2025-05-07T19:45:26.0801422Z 2025-05-07T19:45:26.0801426Z 2025-05-07T19:45:26.0801638Z  2025-05-07T19:45:26.0801872Z 2025-05-07T19:45:26.0801876Z 2025-05-07T19:45:26.0801879Z 2025-05-07T19:45:26.0801883Z 2025-05-07T19:45:26.0801887Z 2025-05-07T19:45:26.0801911Z 2025-05-07T19:45:26.0801914Z 2025-05-07T19:45:26.0801918Z 2025-05-07T19:45:26.0801921Z 2025-05-07T19:45:26.0801924Z 2025-05-07T19:45:26.0802123Z  2025-05-07T19:45:26.0802364Z 2025-05-07T19:45:26.0802367Z 2025-05-07T19:45:26.0802371Z 2025-05-07T19:45:26.0802375Z 2025-05-07T19:45:26.0802378Z 2025-05-07T19:45:26.0802411Z 2025-05-07T19:45:26.0802415Z 2025-05-07T19:45:26.0802418Z 2025-05-07T19:45:26.0802422Z 2025-05-07T19:45:26.0802425Z 2025-05-07T19:45:26.0802429Z 2025-05-07T19:45:26.0802780Z  2025-05-07T19:45:26.0803030Z 2025-05-07T19:45:26.0803034Z 2025-05-07T19:45:26.0803037Z 2025-05-07T19:45:26.0803041Z 2025-05-07T19:45:26.0803062Z 2025-05-07T19:45:26.0803066Z 2025-05-07T19:45:26.0803069Z 2025-05-07T19:45:26.0803073Z 2025-05-07T19:45:26.0803077Z 2025-05-07T19:45:26.0803081Z 2025-05-07T19:45:26.0803085Z 2025-05-07T19:45:26.0803089Z 2025-05-07T19:45:26.0803300Z  2025-05-07T19:45:26.0803540Z 2025-05-07T19:45:26.0803543Z 2025-05-07T19:45:26.0803567Z 2025-05-07T19:45:26.0803571Z 2025-05-07T19:45:26.0803574Z 2025-05-07T19:45:26.0803577Z 2025-05-07T19:45:26.0803581Z 2025-05-07T19:45:26.0803584Z 2025-05-07T19:45:26.0803588Z 2025-05-07T19:45:26.0803591Z 2025-05-07T19:45:26.0803595Z 2025-05-07T19:45:26.0803598Z 2025-05-07T19:45:26.0803602Z 2025-05-07T19:45:26.0803819Z  2025-05-07T19:45:26.0804100Z 2025-05-07T19:45:26.0804104Z 2025-05-07T19:45:26.0804107Z 2025-05-07T19:45:26.0804111Z 2025-05-07T19:45:26.0804114Z 2025-05-07T19:45:26.0804118Z 2025-05-07T19:45:26.0804121Z 2025-05-07T19:45:26.0804125Z 2025-05-07T19:45:26.0804128Z 2025-05-07T19:45:26.0804132Z 2025-05-07T19:45:26.0804135Z 2025-05-07T19:45:26.0804139Z 2025-05-07T19:45:26.0804142Z 2025-05-07T19:45:26.0804146Z 2025-05-07T19:45:26.0804369Z  2025-05-07T19:45:26.0804640Z 2025-05-07T19:45:26.0804643Z 2025-05-07T19:45:26.0804647Z 2025-05-07T19:45:26.0804651Z 2025-05-07T19:45:26.0804654Z 2025-05-07T19:45:26.0804658Z 2025-05-07T19:45:26.0804661Z 2025-05-07T19:45:26.0804664Z 2025-05-07T19:45:26.0804668Z 2025-05-07T19:45:26.0804671Z 2025-05-07T19:45:26.0804675Z 2025-05-07T19:45:26.0804678Z 2025-05-07T19:45:26.0804682Z 2025-05-07T19:45:26.0804686Z 2025-05-07T19:45:26.0804694Z 2025-05-07T19:45:26.0804941Z  2025-05-07T19:45:26.0805187Z 2025-05-07T19:45:26.0805191Z 2025-05-07T19:45:26.0805194Z 2025-05-07T19:45:26.0805197Z 2025-05-07T19:45:26.0805201Z 2025-05-07T19:45:26.0805204Z 2025-05-07T19:45:26.0805460Z 2025-05-07T19:45:26.0805464Z 2025-05-07T19:45:26.0805468Z 2025-05-07T19:45:26.0805471Z 2025-05-07T19:45:26.0805474Z 2025-05-07T19:45:26.0805510Z 2025-05-07T19:45:26.0805513Z 2025-05-07T19:45:26.0805517Z 2025-05-07T19:45:26.0805520Z 2025-05-07T19:45:26.0805523Z 2025-05-07T19:45:26.0805752Z  2025-05-07T19:45:26.0806005Z 2025-05-07T19:45:26.0806009Z 2025-05-07T19:45:26.0806013Z 2025-05-07T19:45:26.0806016Z 2025-05-07T19:45:26.0806020Z 2025-05-07T19:45:26.0806041Z 2025-05-07T19:45:26.0806044Z 2025-05-07T19:45:26.0806048Z 2025-05-07T19:45:26.0806051Z 2025-05-07T19:45:26.0806124Z 2025-05-07T19:45:26.0806128Z 2025-05-07T19:45:26.0806132Z 2025-05-07T19:45:26.0806135Z 2025-05-07T19:45:26.0806139Z 2025-05-07T19:45:26.0806142Z 2025-05-07T19:45:26.0806146Z 2025-05-07T19:45:26.0806149Z 2025-05-07T19:45:26.0806382Z  2025-05-07T19:45:26.0806657Z 2025-05-07T19:45:26.0806661Z 2025-05-07T19:45:26.0806664Z 2025-05-07T19:45:26.0806669Z 2025-05-07T19:45:26.0806672Z 2025-05-07T19:45:26.0806676Z 2025-05-07T19:45:26.0806679Z 2025-05-07T19:45:26.0806683Z 2025-05-07T19:45:26.0806686Z 2025-05-07T19:45:26.0806690Z 2025-05-07T19:45:26.0806693Z 2025-05-07T19:45:26.0806697Z 2025-05-07T19:45:26.0806700Z 2025-05-07T19:45:26.0806704Z 2025-05-07T19:45:26.0806707Z 2025-05-07T19:45:26.0806712Z 2025-05-07T19:45:26.0806715Z 2025-05-07T19:45:26.0806719Z 2025-05-07T19:45:26.0807041Z  2025-05-07T19:45:26.0807308Z 2025-05-07T19:45:26.0807311Z 2025-05-07T19:45:26.0807416Z  2025-05-07T19:45:26.0807545Z 2025-05-07T19:45:26.0807548Z 2025-05-07T19:45:26.0807650Z  2025-05-07T19:45:26.0807769Z 2025-05-07T19:45:26.0807773Z 2025-05-07T19:45:26.0807777Z 2025-05-07T19:45:26.0807908Z  2025-05-07T19:45:26.0808031Z 2025-05-07T19:45:26.0808035Z 2025-05-07T19:45:26.0808039Z 2025-05-07T19:45:26.0808042Z 2025-05-07T19:45:26.0808170Z  2025-05-07T19:45:26.0808321Z 2025-05-07T19:45:26.0808324Z 2025-05-07T19:45:26.0808328Z 2025-05-07T19:45:26.0808331Z 2025-05-07T19:45:26.0808335Z 2025-05-07T19:45:26.0808449Z  2025-05-07T19:45:26.0808582Z 2025-05-07T19:45:26.0808585Z 2025-05-07T19:45:26.0808589Z 2025-05-07T19:45:26.0808592Z 2025-05-07T19:45:26.0808596Z 2025-05-07T19:45:26.0808616Z 2025-05-07T19:45:26.0808730Z  2025-05-07T19:45:26.0808868Z 2025-05-07T19:45:26.0808872Z 2025-05-07T19:45:26.0808875Z 2025-05-07T19:45:26.0808886Z 2025-05-07T19:45:26.0808890Z 2025-05-07T19:45:26.0808894Z 2025-05-07T19:45:26.0808897Z 2025-05-07T19:45:26.0809035Z  2025-05-07T19:45:26.0809186Z 2025-05-07T19:45:26.0809190Z 2025-05-07T19:45:26.0809194Z 2025-05-07T19:45:26.0809197Z 2025-05-07T19:45:26.0809200Z 2025-05-07T19:45:26.0809204Z 2025-05-07T19:45:26.0809207Z 2025-05-07T19:45:26.0809211Z 2025-05-07T19:45:26.0809335Z  2025-05-07T19:45:26.0809513Z 2025-05-07T19:45:26.0809517Z 2025-05-07T19:45:26.0809520Z 2025-05-07T19:45:26.0809524Z 2025-05-07T19:45:26.0809527Z 2025-05-07T19:45:26.0809531Z 2025-05-07T19:45:26.0809534Z 2025-05-07T19:45:26.0809538Z 2025-05-07T19:45:26.0809541Z 2025-05-07T19:45:26.0809668Z  2025-05-07T19:45:26.0809852Z 2025-05-07T19:45:26.0809856Z 2025-05-07T19:45:26.0809860Z 2025-05-07T19:45:26.0809863Z 2025-05-07T19:45:26.0809867Z 2025-05-07T19:45:26.0809870Z 2025-05-07T19:45:26.0809874Z 2025-05-07T19:45:26.0809877Z 2025-05-07T19:45:26.0809888Z 2025-05-07T19:45:26.0809891Z 2025-05-07T19:45:26.0810026Z  2025-05-07T19:45:26.0810225Z 2025-05-07T19:45:26.0810228Z 2025-05-07T19:45:26.0810232Z 2025-05-07T19:45:26.0810235Z 2025-05-07T19:45:26.0810238Z 2025-05-07T19:45:26.0810242Z 2025-05-07T19:45:26.0810245Z 2025-05-07T19:45:26.0810248Z 2025-05-07T19:45:26.0810252Z 2025-05-07T19:45:26.0810255Z 2025-05-07T19:45:26.0810259Z 2025-05-07T19:45:26.0810392Z  2025-05-07T19:45:26.0810596Z 2025-05-07T19:45:26.0810599Z 2025-05-07T19:45:26.0810602Z 2025-05-07T19:45:26.0810606Z 2025-05-07T19:45:26.0810609Z 2025-05-07T19:45:26.0810613Z 2025-05-07T19:45:26.0810616Z 2025-05-07T19:45:26.0810620Z 2025-05-07T19:45:26.0810624Z 2025-05-07T19:45:26.0810627Z 2025-05-07T19:45:26.0810631Z 2025-05-07T19:45:26.0810635Z 2025-05-07T19:45:26.0810775Z  2025-05-07T19:45:26.0810989Z 2025-05-07T19:45:26.0810993Z 2025-05-07T19:45:26.0811081Z 2025-05-07T19:45:26.0811091Z 2025-05-07T19:45:26.0811094Z 2025-05-07T19:45:26.0811097Z 2025-05-07T19:45:26.0811101Z 2025-05-07T19:45:26.0811104Z 2025-05-07T19:45:26.0811108Z 2025-05-07T19:45:26.0811111Z 2025-05-07T19:45:26.0811115Z 2025-05-07T19:45:26.0811118Z 2025-05-07T19:45:26.0811122Z 2025-05-07T19:45:26.0811287Z  2025-05-07T19:45:26.0811487Z 2025-05-07T19:45:26.0811490Z 2025-05-07T19:45:26.0811494Z 2025-05-07T19:45:26.0811498Z 2025-05-07T19:45:26.0811501Z 2025-05-07T19:45:26.0811505Z 2025-05-07T19:45:26.0811509Z 2025-05-07T19:45:26.0811512Z 2025-05-07T19:45:26.0811515Z 2025-05-07T19:45:26.0811519Z 2025-05-07T19:45:26.0811522Z 2025-05-07T19:45:26.0811526Z 2025-05-07T19:45:26.0811529Z 2025-05-07T19:45:26.0811533Z 2025-05-07T19:45:26.0811698Z  2025-05-07T19:45:26.0811907Z 2025-05-07T19:45:26.0811910Z 2025-05-07T19:45:26.0811914Z 2025-05-07T19:45:26.0811918Z 2025-05-07T19:45:26.0811921Z 2025-05-07T19:45:26.0811989Z 2025-05-07T19:45:26.0811994Z 2025-05-07T19:45:26.0811997Z 2025-05-07T19:45:26.0812001Z 2025-05-07T19:45:26.0812004Z 2025-05-07T19:45:26.0812008Z 2025-05-07T19:45:26.0812011Z 2025-05-07T19:45:26.0812015Z 2025-05-07T19:45:26.0812037Z 2025-05-07T19:45:26.0812041Z 2025-05-07T19:45:26.0812200Z  2025-05-07T19:45:26.0812415Z 2025-05-07T19:45:26.0812419Z 2025-05-07T19:45:26.0812423Z 2025-05-07T19:45:26.0812426Z 2025-05-07T19:45:26.0812430Z 2025-05-07T19:45:26.0812433Z 2025-05-07T19:45:26.0812437Z 2025-05-07T19:45:26.0812440Z 2025-05-07T19:45:26.0812463Z 2025-05-07T19:45:26.0812466Z 2025-05-07T19:45:26.0812470Z 2025-05-07T19:45:26.0812474Z 2025-05-07T19:45:26.0812477Z 2025-05-07T19:45:26.0812481Z 2025-05-07T19:45:26.0812484Z 2025-05-07T19:45:26.0812488Z 2025-05-07T19:45:26.0812643Z  2025-05-07T19:45:26.0812864Z 2025-05-07T19:45:26.0812868Z 2025-05-07T19:45:26.0812871Z 2025-05-07T19:45:26.0812900Z 2025-05-07T19:45:26.0812904Z 2025-05-07T19:45:26.0812907Z 2025-05-07T19:45:26.0812911Z 2025-05-07T19:45:26.0812914Z 2025-05-07T19:45:26.0812917Z 2025-05-07T19:45:26.0812921Z 2025-05-07T19:45:26.0812924Z 2025-05-07T19:45:26.0812928Z 2025-05-07T19:45:26.0812931Z 2025-05-07T19:45:26.0812935Z 2025-05-07T19:45:26.0812938Z 2025-05-07T19:45:26.0812942Z 2025-05-07T19:45:26.0812945Z 2025-05-07T19:45:26.0813108Z  2025-05-07T19:45:26.0813351Z 2025-05-07T19:45:26.0813355Z 2025-05-07T19:45:26.0813359Z 2025-05-07T19:45:26.0813362Z 2025-05-07T19:45:26.0813366Z 2025-05-07T19:45:26.0813369Z 2025-05-07T19:45:26.0813373Z 2025-05-07T19:45:26.0813376Z 2025-05-07T19:45:26.0813380Z 2025-05-07T19:45:26.0813384Z 2025-05-07T19:45:26.0813388Z 2025-05-07T19:45:26.0813391Z 2025-05-07T19:45:26.0813395Z 2025-05-07T19:45:26.0813398Z 2025-05-07T19:45:26.0813402Z 2025-05-07T19:45:26.0813405Z 2025-05-07T19:45:26.0813409Z 2025-05-07T19:45:26.0813416Z 2025-05-07T19:45:26.0813609Z  2025-05-07T19:45:26.0813838Z 2025-05-07T19:45:26.0813842Z 2025-05-07T19:45:26.0813945Z  2025-05-07T19:45:26.0814073Z 2025-05-07T19:45:26.0814077Z 2025-05-07T19:45:26.0814179Z  2025-05-07T19:45:26.0814292Z 2025-05-07T19:45:26.0814295Z 2025-05-07T19:45:26.0814299Z 2025-05-07T19:45:26.0814420Z  2025-05-07T19:45:26.0814537Z 2025-05-07T19:45:26.0814541Z 2025-05-07T19:45:26.0814544Z 2025-05-07T19:45:26.0814548Z 2025-05-07T19:45:26.0814657Z  2025-05-07T19:45:26.0814798Z 2025-05-07T19:45:26.0814802Z 2025-05-07T19:45:26.0814805Z 2025-05-07T19:45:26.0814809Z 2025-05-07T19:45:26.0814812Z 2025-05-07T19:45:26.0814924Z  2025-05-07T19:45:26.0815056Z 2025-05-07T19:45:26.0815059Z 2025-05-07T19:45:26.0815063Z 2025-05-07T19:45:26.0815067Z 2025-05-07T19:45:26.0815070Z 2025-05-07T19:45:26.0815091Z 2025-05-07T19:45:26.0815198Z  2025-05-07T19:45:26.0815401Z 2025-05-07T19:45:26.0815409Z 2025-05-07T19:45:26.0815413Z 2025-05-07T19:45:26.0815416Z 2025-05-07T19:45:26.0815420Z 2025-05-07T19:45:26.0815423Z 2025-05-07T19:45:26.0815427Z 2025-05-07T19:45:26.0815569Z  2025-05-07T19:45:26.0815715Z 2025-05-07T19:45:26.0815718Z 2025-05-07T19:45:26.0815722Z 2025-05-07T19:45:26.0815726Z 2025-05-07T19:45:26.0815729Z 2025-05-07T19:45:26.0815733Z 2025-05-07T19:45:26.0815736Z 2025-05-07T19:45:26.0815740Z 2025-05-07T19:45:26.0815864Z  2025-05-07T19:45:26.0816040Z 2025-05-07T19:45:26.0816043Z 2025-05-07T19:45:26.0816047Z 2025-05-07T19:45:26.0816050Z 2025-05-07T19:45:26.0816054Z 2025-05-07T19:45:26.0816058Z 2025-05-07T19:45:26.0816061Z 2025-05-07T19:45:26.0816065Z 2025-05-07T19:45:26.0816068Z 2025-05-07T19:45:26.0816193Z  2025-05-07T19:45:26.0816381Z 2025-05-07T19:45:26.0816385Z 2025-05-07T19:45:26.0816388Z 2025-05-07T19:45:26.0816391Z 2025-05-07T19:45:26.0816395Z 2025-05-07T19:45:26.0816403Z 2025-05-07T19:45:26.0816465Z 2025-05-07T19:45:26.0816469Z 2025-05-07T19:45:26.0816473Z 2025-05-07T19:45:26.0816476Z 2025-05-07T19:45:26.0816611Z  2025-05-07T19:45:26.0816812Z 2025-05-07T19:45:26.0816815Z 2025-05-07T19:45:26.0816820Z 2025-05-07T19:45:26.0816823Z 2025-05-07T19:45:26.0816826Z 2025-05-07T19:45:26.0816830Z 2025-05-07T19:45:26.0816833Z 2025-05-07T19:45:26.0816837Z 2025-05-07T19:45:26.0816841Z 2025-05-07T19:45:26.0816844Z 2025-05-07T19:45:26.0816848Z 2025-05-07T19:45:26.0816984Z  2025-05-07T19:45:26.0817194Z 2025-05-07T19:45:26.0817197Z 2025-05-07T19:45:26.0817201Z 2025-05-07T19:45:26.0817204Z 2025-05-07T19:45:26.0817208Z 2025-05-07T19:45:26.0817211Z 2025-05-07T19:45:26.0817215Z 2025-05-07T19:45:26.0817219Z 2025-05-07T19:45:26.0817222Z 2025-05-07T19:45:26.0817226Z 2025-05-07T19:45:26.0817229Z 2025-05-07T19:45:26.0817233Z 2025-05-07T19:45:26.0817373Z  2025-05-07T19:45:26.0817586Z 2025-05-07T19:45:26.0817593Z 2025-05-07T19:45:26.0817597Z 2025-05-07T19:45:26.0817600Z 2025-05-07T19:45:26.0817604Z 2025-05-07T19:45:26.0817607Z 2025-05-07T19:45:26.0817611Z 2025-05-07T19:45:26.0817614Z 2025-05-07T19:45:26.0817618Z 2025-05-07T19:45:26.0817622Z 2025-05-07T19:45:26.0817625Z 2025-05-07T19:45:26.0817629Z 2025-05-07T19:45:26.0817632Z 2025-05-07T19:45:26.0817794Z  2025-05-07T19:45:26.0817996Z 2025-05-07T19:45:26.0818000Z 2025-05-07T19:45:26.0818004Z 2025-05-07T19:45:26.0818007Z 2025-05-07T19:45:26.0818011Z 2025-05-07T19:45:26.0818015Z 2025-05-07T19:45:26.0818018Z 2025-05-07T19:45:26.0818022Z 2025-05-07T19:45:26.0818025Z 2025-05-07T19:45:26.0818029Z 2025-05-07T19:45:26.0818032Z 2025-05-07T19:45:26.0818036Z 2025-05-07T19:45:26.0818039Z 2025-05-07T19:45:26.0818043Z 2025-05-07T19:45:26.0818210Z  2025-05-07T19:45:26.0818419Z 2025-05-07T19:45:26.0818422Z 2025-05-07T19:45:26.0818429Z 2025-05-07T19:45:26.0818436Z 2025-05-07T19:45:26.0818439Z 2025-05-07T19:45:26.0818443Z 2025-05-07T19:45:26.0818446Z 2025-05-07T19:45:26.0818449Z 2025-05-07T19:45:26.0818453Z 2025-05-07T19:45:26.0818456Z 2025-05-07T19:45:26.0818460Z 2025-05-07T19:45:26.0818464Z 2025-05-07T19:45:26.0818467Z 2025-05-07T19:45:26.0818488Z 2025-05-07T19:45:26.0818491Z 2025-05-07T19:45:26.0818648Z  2025-05-07T19:45:26.0818852Z 2025-05-07T19:45:26.0818855Z 2025-05-07T19:45:26.0818859Z 2025-05-07T19:45:26.0818863Z 2025-05-07T19:45:26.0818866Z 2025-05-07T19:45:26.0818870Z 2025-05-07T19:45:26.0818873Z 2025-05-07T19:45:26.0818877Z 2025-05-07T19:45:26.0818895Z 2025-05-07T19:45:26.0818898Z 2025-05-07T19:45:26.0818902Z 2025-05-07T19:45:26.0818905Z 2025-05-07T19:45:26.0818909Z 2025-05-07T19:45:26.0818912Z 2025-05-07T19:45:26.0818916Z 2025-05-07T19:45:26.0818919Z 2025-05-07T19:45:26.0819079Z  2025-05-07T19:45:26.0819298Z 2025-05-07T19:45:26.0819371Z 2025-05-07T19:45:26.0819375Z 2025-05-07T19:45:26.0819397Z 2025-05-07T19:45:26.0819400Z 2025-05-07T19:45:26.0819403Z 2025-05-07T19:45:26.0819407Z 2025-05-07T19:45:26.0819410Z 2025-05-07T19:45:26.0819414Z 2025-05-07T19:45:26.0819417Z 2025-05-07T19:45:26.0819421Z 2025-05-07T19:45:26.0819424Z 2025-05-07T19:45:26.0819428Z 2025-05-07T19:45:26.0819432Z 2025-05-07T19:45:26.0819435Z 2025-05-07T19:45:26.0819439Z 2025-05-07T19:45:26.0819442Z 2025-05-07T19:45:26.0819608Z  2025-05-07T19:45:26.0819853Z 2025-05-07T19:45:26.0819857Z 2025-05-07T19:45:26.0819860Z 2025-05-07T19:45:26.0819863Z 2025-05-07T19:45:26.0819867Z 2025-05-07T19:45:26.0819871Z 2025-05-07T19:45:26.0819874Z 2025-05-07T19:45:26.0819877Z 2025-05-07T19:45:26.0819881Z 2025-05-07T19:45:26.0819884Z 2025-05-07T19:45:26.0819888Z 2025-05-07T19:45:26.0819891Z 2025-05-07T19:45:26.0819895Z 2025-05-07T19:45:26.0819898Z 2025-05-07T19:45:26.0819902Z 2025-05-07T19:45:26.0819908Z 2025-05-07T19:45:26.0821077Z 2025-05-07T19:45:26.0821081Z 2025-05-07T19:45:26.0821285Z  2025-05-07T19:45:26.0821514Z 2025-05-07T19:45:26.0821518Z 2025-05-07T19:45:26.0821623Z  2025-05-07T19:45:26.0821755Z 2025-05-07T19:45:26.0821759Z 2025-05-07T19:45:26.0821864Z  2025-05-07T19:45:26.0821981Z 2025-05-07T19:45:26.0821985Z 2025-05-07T19:45:26.0821989Z 2025-05-07T19:45:26.0822115Z  2025-05-07T19:45:26.0822233Z 2025-05-07T19:45:26.0822238Z 2025-05-07T19:45:26.0822241Z 2025-05-07T19:45:26.0822245Z 2025-05-07T19:45:26.0822353Z  2025-05-07T19:45:26.0822503Z 2025-05-07T19:45:26.0822506Z 2025-05-07T19:45:26.0822510Z 2025-05-07T19:45:26.0822513Z 2025-05-07T19:45:26.0822517Z 2025-05-07T19:45:26.0822627Z  2025-05-07T19:45:26.0822761Z 2025-05-07T19:45:26.0822764Z 2025-05-07T19:45:26.0822768Z 2025-05-07T19:45:26.0822772Z 2025-05-07T19:45:26.0822775Z 2025-05-07T19:45:26.0822796Z 2025-05-07T19:45:26.0822920Z  2025-05-07T19:45:26.0823057Z 2025-05-07T19:45:26.0823061Z 2025-05-07T19:45:26.0823064Z 2025-05-07T19:45:26.0823068Z 2025-05-07T19:45:26.0823071Z 2025-05-07T19:45:26.0823075Z 2025-05-07T19:45:26.0823079Z 2025-05-07T19:45:26.0823213Z  2025-05-07T19:45:26.0823359Z 2025-05-07T19:45:26.0823363Z 2025-05-07T19:45:26.0823367Z 2025-05-07T19:45:26.0823370Z 2025-05-07T19:45:26.0823374Z 2025-05-07T19:45:26.0823377Z 2025-05-07T19:45:26.0823381Z 2025-05-07T19:45:26.0823384Z 2025-05-07T19:45:26.0823526Z  2025-05-07T19:45:26.0823686Z 2025-05-07T19:45:26.0823690Z 2025-05-07T19:45:26.0823693Z 2025-05-07T19:45:26.0823697Z 2025-05-07T19:45:26.0823700Z 2025-05-07T19:45:26.0823704Z 2025-05-07T19:45:26.0823708Z 2025-05-07T19:45:26.0823711Z 2025-05-07T19:45:26.0823714Z 2025-05-07T19:45:26.0823838Z  2025-05-07T19:45:26.0824020Z 2025-05-07T19:45:26.0824026Z 2025-05-07T19:45:26.0824030Z 2025-05-07T19:45:26.0824037Z 2025-05-07T19:45:26.0824044Z 2025-05-07T19:45:26.0824047Z 2025-05-07T19:45:26.0824051Z 2025-05-07T19:45:26.0824054Z 2025-05-07T19:45:26.0824058Z 2025-05-07T19:45:26.0824061Z 2025-05-07T19:45:26.0824191Z  2025-05-07T19:45:26.0824456Z 2025-05-07T19:45:26.0824460Z 2025-05-07T19:45:26.0824464Z 2025-05-07T19:45:26.0824467Z 2025-05-07T19:45:26.0824471Z 2025-05-07T19:45:26.0824474Z 2025-05-07T19:45:26.0824478Z 2025-05-07T19:45:26.0824481Z 2025-05-07T19:45:26.0824485Z 2025-05-07T19:45:26.0824488Z 2025-05-07T19:45:26.0824491Z 2025-05-07T19:45:26.0824625Z  2025-05-07T19:45:26.0824825Z 2025-05-07T19:45:26.0824828Z 2025-05-07T19:45:26.0824832Z 2025-05-07T19:45:26.0824835Z 2025-05-07T19:45:26.0824839Z 2025-05-07T19:45:26.0824842Z 2025-05-07T19:45:26.0824846Z 2025-05-07T19:45:26.0824849Z 2025-05-07T19:45:26.0824853Z 2025-05-07T19:45:26.0824856Z 2025-05-07T19:45:26.0824859Z 2025-05-07T19:45:26.0824863Z 2025-05-07T19:45:26.0825070Z  2025-05-07T19:45:26.0825273Z 2025-05-07T19:45:26.0825277Z 2025-05-07T19:45:26.0825280Z 2025-05-07T19:45:26.0825284Z 2025-05-07T19:45:26.0825288Z 2025-05-07T19:45:26.0825291Z 2025-05-07T19:45:26.0825295Z 2025-05-07T19:45:26.0825298Z 2025-05-07T19:45:26.0825302Z 2025-05-07T19:45:26.0825305Z 2025-05-07T19:45:26.0825309Z 2025-05-07T19:45:26.0825313Z 2025-05-07T19:45:26.0825316Z 2025-05-07T19:45:26.0825467Z  2025-05-07T19:45:26.0825665Z 2025-05-07T19:45:26.0825669Z 2025-05-07T19:45:26.0825673Z 2025-05-07T19:45:26.0825676Z 2025-05-07T19:45:26.0825680Z 2025-05-07T19:45:26.0825684Z 2025-05-07T19:45:26.0825687Z 2025-05-07T19:45:26.0825691Z 2025-05-07T19:45:26.0825694Z 2025-05-07T19:45:26.0825698Z 2025-05-07T19:45:26.0825702Z 2025-05-07T19:45:26.0825705Z 2025-05-07T19:45:26.0825709Z 2025-05-07T19:45:26.0825713Z 2025-05-07T19:45:26.0825870Z  2025-05-07T19:45:26.0826081Z 2025-05-07T19:45:26.0826158Z 2025-05-07T19:45:26.0826162Z 2025-05-07T19:45:26.0826166Z 2025-05-07T19:45:26.0826169Z 2025-05-07T19:45:26.0826173Z 2025-05-07T19:45:26.0826176Z 2025-05-07T19:45:26.0826180Z 2025-05-07T19:45:26.0826183Z 2025-05-07T19:45:26.0826187Z 2025-05-07T19:45:26.0826190Z 2025-05-07T19:45:26.0826194Z 2025-05-07T19:45:26.0826215Z 2025-05-07T19:45:26.0826218Z 2025-05-07T19:45:26.0826221Z 2025-05-07T19:45:26.0826378Z  2025-05-07T19:45:26.0826592Z 2025-05-07T19:45:26.0826595Z 2025-05-07T19:45:26.0826599Z 2025-05-07T19:45:26.0826602Z 2025-05-07T19:45:26.0826606Z 2025-05-07T19:45:26.0826610Z 2025-05-07T19:45:26.0826613Z 2025-05-07T19:45:26.0826635Z 2025-05-07T19:45:26.0826638Z 2025-05-07T19:45:26.0826642Z 2025-05-07T19:45:26.0826645Z 2025-05-07T19:45:26.0826649Z 2025-05-07T19:45:26.0826653Z 2025-05-07T19:45:26.0826656Z 2025-05-07T19:45:26.0826660Z 2025-05-07T19:45:26.0826663Z 2025-05-07T19:45:26.0826827Z  2025-05-07T19:45:26.0827051Z 2025-05-07T19:45:26.0827055Z 2025-05-07T19:45:26.0827077Z 2025-05-07T19:45:26.0827080Z 2025-05-07T19:45:26.0827084Z 2025-05-07T19:45:26.0827087Z 2025-05-07T19:45:26.0827091Z 2025-05-07T19:45:26.0827095Z 2025-05-07T19:45:26.0827098Z 2025-05-07T19:45:26.0827102Z 2025-05-07T19:45:26.0827105Z 2025-05-07T19:45:26.0827109Z 2025-05-07T19:45:26.0827112Z 2025-05-07T19:45:26.0827115Z 2025-05-07T19:45:26.0827119Z 2025-05-07T19:45:26.0827122Z 2025-05-07T19:45:26.0827126Z 2025-05-07T19:45:26.0827289Z  2025-05-07T19:45:26.0827530Z 2025-05-07T19:45:26.0827533Z 2025-05-07T19:45:26.0827537Z 2025-05-07T19:45:26.0827540Z 2025-05-07T19:45:26.0827543Z 2025-05-07T19:45:26.0827547Z 2025-05-07T19:45:26.0827551Z 2025-05-07T19:45:26.0827554Z 2025-05-07T19:45:26.0827558Z 2025-05-07T19:45:26.0827561Z 2025-05-07T19:45:26.0827565Z 2025-05-07T19:45:26.0827568Z 2025-05-07T19:45:26.0827572Z 2025-05-07T19:45:26.0827582Z 2025-05-07T19:45:26.0827585Z 2025-05-07T19:45:26.0827589Z 2025-05-07T19:45:26.0827592Z 2025-05-07T19:45:26.0827596Z 2025-05-07T19:45:26.0827786Z  2025-05-07T19:45:26.0828015Z 2025-05-07T19:45:26.0828018Z 2025-05-07T19:45:26.0828121Z  2025-05-07T19:45:26.0828254Z 2025-05-07T19:45:26.0828257Z 2025-05-07T19:45:26.0828353Z  2025-05-07T19:45:26.0828462Z 2025-05-07T19:45:26.0828465Z 2025-05-07T19:45:26.0828469Z 2025-05-07T19:45:26.0828586Z  2025-05-07T19:45:26.0828700Z 2025-05-07T19:45:26.0828704Z 2025-05-07T19:45:26.0828707Z 2025-05-07T19:45:26.0828711Z 2025-05-07T19:45:26.0828826Z  2025-05-07T19:45:26.0828951Z 2025-05-07T19:45:26.0828954Z 2025-05-07T19:45:26.0828958Z 2025-05-07T19:45:26.0828962Z 2025-05-07T19:45:26.0828966Z 2025-05-07T19:45:26.0829095Z  2025-05-07T19:45:26.0829224Z 2025-05-07T19:45:26.0829228Z 2025-05-07T19:45:26.0829232Z 2025-05-07T19:45:26.0829295Z 2025-05-07T19:45:26.0829302Z 2025-05-07T19:45:26.0829306Z 2025-05-07T19:45:26.0829437Z  2025-05-07T19:45:26.0829572Z 2025-05-07T19:45:26.0829576Z 2025-05-07T19:45:26.0829579Z 2025-05-07T19:45:26.0829583Z 2025-05-07T19:45:26.0829586Z 2025-05-07T19:45:26.0829590Z 2025-05-07T19:45:26.0829594Z 2025-05-07T19:45:26.0829711Z  2025-05-07T19:45:26.0829874Z 2025-05-07T19:45:26.0829878Z 2025-05-07T19:45:26.0829881Z 2025-05-07T19:45:26.0829885Z 2025-05-07T19:45:26.0829888Z 2025-05-07T19:45:26.0829891Z 2025-05-07T19:45:26.0829895Z 2025-05-07T19:45:26.0829899Z 2025-05-07T19:45:26.0830018Z  2025-05-07T19:45:26.0830192Z 2025-05-07T19:45:26.0830195Z 2025-05-07T19:45:26.0830199Z 2025-05-07T19:45:26.0830202Z 2025-05-07T19:45:26.0830206Z 2025-05-07T19:45:26.0830209Z 2025-05-07T19:45:26.0830213Z 2025-05-07T19:45:26.0830216Z 2025-05-07T19:45:26.0830220Z 2025-05-07T19:45:26.0830345Z  2025-05-07T19:45:26.0830510Z 2025-05-07T19:45:26.0830575Z 2025-05-07T19:45:26.0830595Z 2025-05-07T19:45:26.0830599Z 2025-05-07T19:45:26.0830603Z 2025-05-07T19:45:26.0830606Z 2025-05-07T19:45:26.0830610Z 2025-05-07T19:45:26.0830613Z 2025-05-07T19:45:26.0830616Z 2025-05-07T19:45:26.0830621Z 2025-05-07T19:45:26.0830753Z  2025-05-07T19:45:26.0830925Z 2025-05-07T19:45:26.0830929Z 2025-05-07T19:45:26.0830933Z 2025-05-07T19:45:26.0830952Z 2025-05-07T19:45:26.0830956Z 2025-05-07T19:45:26.0830959Z 2025-05-07T19:45:26.0830963Z 2025-05-07T19:45:26.0830966Z 2025-05-07T19:45:26.0830970Z 2025-05-07T19:45:26.0830973Z 2025-05-07T19:45:26.0830977Z 2025-05-07T19:45:26.0831111Z  2025-05-07T19:45:26.0831294Z 2025-05-07T19:45:26.0831297Z 2025-05-07T19:45:26.0831302Z 2025-05-07T19:45:26.0831322Z 2025-05-07T19:45:26.0831325Z 2025-05-07T19:45:26.0831329Z 2025-05-07T19:45:26.0831332Z 2025-05-07T19:45:26.0831336Z 2025-05-07T19:45:26.0831339Z 2025-05-07T19:45:26.0831343Z 2025-05-07T19:45:26.0831469Z 2025-05-07T19:45:26.0831474Z 2025-05-07T19:45:26.0831638Z  2025-05-07T19:45:26.0831831Z 2025-05-07T19:45:26.0831835Z 2025-05-07T19:45:26.0831856Z 2025-05-07T19:45:26.0831860Z 2025-05-07T19:45:26.0831863Z 2025-05-07T19:45:26.0831867Z 2025-05-07T19:45:26.0831870Z 2025-05-07T19:45:26.0831874Z 2025-05-07T19:45:26.0831878Z 2025-05-07T19:45:26.0831881Z 2025-05-07T19:45:26.0831885Z 2025-05-07T19:45:26.0831888Z 2025-05-07T19:45:26.0831892Z 2025-05-07T19:45:26.0832035Z  2025-05-07T19:45:26.0832256Z 2025-05-07T19:45:26.0832260Z 2025-05-07T19:45:26.0832263Z 2025-05-07T19:45:26.0832267Z 2025-05-07T19:45:26.0832270Z 2025-05-07T19:45:26.0832274Z 2025-05-07T19:45:26.0832277Z 2025-05-07T19:45:26.0832281Z 2025-05-07T19:45:26.0832284Z 2025-05-07T19:45:26.0832288Z 2025-05-07T19:45:26.0832291Z 2025-05-07T19:45:26.0832295Z 2025-05-07T19:45:26.0832298Z 2025-05-07T19:45:26.0832302Z 2025-05-07T19:45:26.0832457Z  2025-05-07T19:45:26.0832680Z 2025-05-07T19:45:26.0832684Z 2025-05-07T19:45:26.0832687Z 2025-05-07T19:45:26.0832691Z 2025-05-07T19:45:26.0832695Z 2025-05-07T19:45:26.0832698Z 2025-05-07T19:45:26.0832702Z 2025-05-07T19:45:26.0832706Z 2025-05-07T19:45:26.0832709Z 2025-05-07T19:45:26.0832713Z 2025-05-07T19:45:26.0832717Z 2025-05-07T19:45:26.0832721Z 2025-05-07T19:45:26.0832725Z 2025-05-07T19:45:26.0832728Z 2025-05-07T19:45:26.0832732Z 2025-05-07T19:45:26.0832902Z  2025-05-07T19:45:26.0833116Z 2025-05-07T19:45:26.0833119Z 2025-05-07T19:45:26.0833124Z 2025-05-07T19:45:26.0833128Z 2025-05-07T19:45:26.0833131Z 2025-05-07T19:45:26.0833135Z 2025-05-07T19:45:26.0833138Z 2025-05-07T19:45:26.0833142Z 2025-05-07T19:45:26.0833145Z 2025-05-07T19:45:26.0833149Z 2025-05-07T19:45:26.0833152Z 2025-05-07T19:45:26.0833156Z 2025-05-07T19:45:26.0833159Z 2025-05-07T19:45:26.0833163Z 2025-05-07T19:45:26.0833166Z 2025-05-07T19:45:26.0833255Z 2025-05-07T19:45:26.0833411Z  2025-05-07T19:45:26.0833629Z 2025-05-07T19:45:26.0833633Z 2025-05-07T19:45:26.0833637Z 2025-05-07T19:45:26.0833640Z 2025-05-07T19:45:26.0833644Z 2025-05-07T19:45:26.0833648Z 2025-05-07T19:45:26.0833652Z 2025-05-07T19:45:26.0833656Z 2025-05-07T19:45:26.0833659Z 2025-05-07T19:45:26.0833663Z 2025-05-07T19:45:26.0833683Z 2025-05-07T19:45:26.0833687Z 2025-05-07T19:45:26.0833690Z 2025-05-07T19:45:26.0833694Z 2025-05-07T19:45:26.0833697Z 2025-05-07T19:45:26.0833701Z 2025-05-07T19:45:26.0833704Z 2025-05-07T19:45:26.0833878Z  done 2025-05-07T19:45:26.3918317Z Preparing transaction: | / - done 2025-05-07T19:45:29.5104884Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:31.9330488Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:32.2434266Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:33.8786636Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:33.8787546Z 2025-05-07T19:45:33.8808846Z 2025-05-07T19:45:33.8834852Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:36.0169494Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:36.0171134Z 2025-05-07T19:45:36.0171233Z Collecting build 2025-05-07T19:45:36.0171602Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:36.0172482Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from build) (25.0) 2025-05-07T19:45:36.0173230Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:36.0173689Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:36.0174195Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:36.0174656Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:36.0175099Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:36.0175380Z 2025-05-07T19:45:36.0175578Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:36.0175883Z 2025-05-07T19:45:37.6588086Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:37.6588431Z 2025-05-07T19:45:37.7157769Z [CHECK] Binary make found in PATH 2025-05-07T19:45:39.2936952Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:39.2937564Z 2025-05-07T19:45:39.3516926Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:40.9301899Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:40.9302727Z 2025-05-07T19:45:40.9862673Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:42.6603190Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:44.4604141Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:46.1713343Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:47.9538074Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:49.6004044Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:49.6004904Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:49.6078147Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:49.6078672Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:49.6079269Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:49.6079650Z env: 2025-05-07T19:45:49.6080076Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:49.6080416Z BUILD_ENV: build_binary 2025-05-07T19:45:49.6080679Z BUILD_TARGET: default 2025-05-07T19:45:49.6080945Z BUILD_VARIANT: cuda 2025-05-07T19:45:49.6081209Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:49.6081468Z ##[endgroup] 2025-05-07T19:45:50.0195005Z ################################################################################ 2025-05-07T19:45:50.0195404Z # Install CUDA 2025-05-07T19:45:50.0195627Z # 2025-05-07T19:45:50.0214554Z # [2025-05-07T19:45:50.020Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:50.0215100Z ################################################################################ 2025-05-07T19:45:50.0215405Z 2025-05-07T19:45:50.0235473Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:50.1089938Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:50.1091024Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:50.1092771Z + conda clean --packages --tarball -y 2025-05-07T19:45:50.1093427Z 2025-05-07T19:45:50.6560478Z Will remove 148 (631.5 MB) tarball(s). 2025-05-07T19:45:50.6561448Z Will remove 22 (115.3 MB) package(s). 2025-05-07T19:45:50.7136082Z 2025-05-07T19:45:50.7138964Z + conda clean --all -y 2025-05-07T19:45:50.7139343Z 2025-05-07T19:45:51.3086805Z There are no unused tarball(s) to remove. 2025-05-07T19:45:51.3087841Z Will remove 1 index cache(s). 2025-05-07T19:45:51.3088697Z There are no unused package(s) to remove. 2025-05-07T19:45:51.3089612Z There are no tempfile(s) to remove. 2025-05-07T19:45:51.3090483Z There are no logfile(s) to remove. 2025-05-07T19:45:51.3658705Z 2025-05-07T19:45:51.3666193Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:51.3694325Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:52.1933013Z Channels: 2025-05-07T19:45:52.1933705Z - conda-forge 2025-05-07T19:45:52.1934354Z Platform: linux-64 2025-05-07T19:46:01.7714908Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:46:05.5577804Z Solving environment: | / - \ | / done 2025-05-07T19:46:05.6955085Z 2025-05-07T19:46:05.6956300Z ## Package Plan ## 2025-05-07T19:46:05.6956915Z 2025-05-07T19:46:05.6957128Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:05.6957470Z 2025-05-07T19:46:05.6957576Z added / updated specs: 2025-05-07T19:46:05.6957835Z - cuda=12.6.3 2025-05-07T19:46:05.6958129Z 2025-05-07T19:46:05.6958135Z 2025-05-07T19:46:05.6958270Z The following packages will be downloaded: 2025-05-07T19:46:05.6958510Z 2025-05-07T19:46:05.6958861Z package | build 2025-05-07T19:46:05.6959353Z ---------------------------|----------------- 2025-05-07T19:46:05.6959731Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:46:05.6960193Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:46:05.6960692Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:46:05.6961127Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:46:05.6961598Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:46:05.6962129Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:46:05.6962682Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:46:05.6963191Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:46:05.6964107Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:46:05.6964637Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:05.6965355Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:05.6966070Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:46:05.6966619Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:05.6967204Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:46:05.6967775Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:46:05.6968291Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:46:05.6968793Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:46:05.6969276Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:46:05.6969781Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:46:05.6970276Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:05.6971058Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:46:05.6971545Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:46:05.6971978Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:05.6972463Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:05.6972924Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:46:05.6973370Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:46:05.6973827Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:46:05.6974320Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:46:05.6974787Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:46:05.6975250Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:46:05.6975727Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:46:05.6976175Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:46:05.6976629Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:46:05.6977088Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:46:05.6977527Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:46:05.6977987Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:46:05.6978439Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:46:05.6978916Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:46:05.6979392Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:46:05.6979870Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:46:05.6980335Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:46:05.6980768Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:46:05.6981234Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:46:05.6981706Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:46:05.6982207Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:05.6982692Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:46:05.6983328Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:05.6983768Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:05.6984222Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:46:05.6985507Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:05.6985989Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:46:05.6986402Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:46:05.6986805Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:05.6987222Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:46:05.6987624Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:46:05.6988029Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:05.6988416Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:46:05.6988858Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:46:05.6989327Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:46:05.6989775Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:46:05.6990238Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:46:05.6990691Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:46:05.6991160Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:46:05.6991718Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:46:05.6992402Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:46:05.6992983Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:46:05.6993483Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:46:05.6994012Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:46:05.6994511Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:46:05.6995032Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:46:05.6995512Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:46:05.6995966Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:46:05.6996410Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:46:05.6996863Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:46:05.6997333Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:46:05.6997799Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:46:05.6998408Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:46:05.6998886Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:46:05.6999355Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:46:05.6999824Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:46:05.7000272Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:46:05.7000730Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:46:05.7001159Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:46:05.7001604Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:46:05.7002163Z libxkbcommon-1.7.0 | h2c5496b_1 579 KB conda-forge 2025-05-07T19:46:05.7002603Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:46:05.7003030Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:46:05.7003466Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:46:05.7003994Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:46:05.7004371Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:46:05.7004771Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:46:05.7005231Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:46:05.7005684Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:46:05.7006107Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:46:05.7006515Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:46:05.7006948Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:46:05.7007399Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:46:05.7007856Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:46:05.7008330Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:46:05.7008804Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:46:05.7009272Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:46:05.7009722Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:46:05.7010225Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:46:05.7010724Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:46:05.7011140Z ------------------------------------------------------------ 2025-05-07T19:46:05.7011499Z Total: 1.59 GB 2025-05-07T19:46:05.7011711Z 2025-05-07T19:46:05.7011841Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:05.7012087Z 2025-05-07T19:46:05.7012264Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:46:05.7012695Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:46:05.7013150Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:46:05.7013599Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:46:05.7014072Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:46:05.7014684Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:46:05.7015285Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:46:05.7015834Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:05.7016408Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:46:05.7016921Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:46:05.7017459Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:46:05.7018053Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:05.7018653Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:46:05.7019287Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:05.7019890Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:05.7020545Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:46:05.7021081Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:46:05.7021584Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:46:05.7022131Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:46:05.7022764Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:46:05.7023352Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:05.7023892Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:46:05.7024381Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:46:05.7024962Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:46:05.7025505Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:46:05.7026001Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:46:05.7026543Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:46:05.7027109Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:46:05.7027665Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:46:05.7028223Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:46:05.7028784Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:46:05.7029324Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:46:05.7029825Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:46:05.7030342Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:46:05.7030838Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:46:05.7031356Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:46:05.7032173Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:46:05.7032831Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:05.7033471Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:46:05.7034069Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:46:05.7034663Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:46:05.7035226Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:46:05.7035805Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:46:05.7036473Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:46:05.7037081Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:46:05.7037727Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:46:05.7038465Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:46:05.7038968Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:46:05.7039503Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:46:05.7040058Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:46:05.7040657Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:46:05.7041149Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:46:05.7041539Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:46:05.7041987Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:46:05.7042504Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:46:05.7042902Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:46:05.7043322Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:46:05.7043766Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:46:05.7044373Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:46:05.7044878Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:46:05.7045388Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:46:05.7045898Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:46:05.7046391Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:46:05.7046926Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:46:05.7047444Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:46:05.7048009Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:46:05.7048586Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:46:05.7049327Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:46:05.7049930Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:46:05.7050516Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:46:05.7051269Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:46:05.7051772Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:46:05.7052266Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:46:05.7052809Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:46:05.7053339Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:46:05.7053889Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:46:05.7054478Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:46:05.7055108Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:46:05.7055732Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:46:05.7056315Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:46:05.7056901Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:46:05.7057465Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:46:05.7057991Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:46:05.7058512Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.7.0-h2c5496b_1 2025-05-07T19:46:05.7059037Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:46:05.7059517Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:46:05.7060034Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:46:05.7060571Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:46:05.7060992Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:46:05.7061418Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:46:05.7061968Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:46:05.7062515Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:46:05.7063001Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:46:05.7063477Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:46:05.7064083Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:46:05.7064677Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:46:05.7065448Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:46:05.7066085Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:46:05.7066796Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:46:05.7067349Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:46:05.7067992Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:46:05.7068610Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:46:05.7068980Z 2025-05-07T19:46:05.7069105Z The following packages will be UPDATED: 2025-05-07T19:46:05.7069326Z 2025-05-07T19:46:05.7069519Z libsqlite 3.46.0-hde9e2c9_0 --> 3.49.2-hee588c1_0 2025-05-07T19:46:05.7069952Z sqlite 3.46.0-h6d4b2fc_0 --> 3.49.2-h9eae976_0 2025-05-07T19:46:05.7070239Z 2025-05-07T19:46:05.7070268Z 2025-05-07T19:46:05.7070271Z 2025-05-07T19:46:05.7070422Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:05.7070833Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:05.7071101Z 2025-05-07T19:46:05.7071519Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:05.7071785Z 2025-05-07T19:46:05.7071789Z 2025-05-07T19:46:05.7083374Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:46:05.7083751Z 2025-05-07T19:46:05.7083908Z 2025-05-07T19:46:05.7083914Z 2025-05-07T19:46:05.7094795Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:46:05.7095674Z 2025-05-07T19:46:05.7095680Z 2025-05-07T19:46:05.7095684Z 2025-05-07T19:46:05.7095688Z 2025-05-07T19:46:05.7114255Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:46:05.7115145Z 2025-05-07T19:46:05.7115156Z 2025-05-07T19:46:05.7115167Z 2025-05-07T19:46:05.7115178Z 2025-05-07T19:46:05.7115188Z 2025-05-07T19:46:05.7115897Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:05.7116744Z 2025-05-07T19:46:05.7116774Z 2025-05-07T19:46:05.7116784Z 2025-05-07T19:46:05.7116796Z 2025-05-07T19:46:05.7116806Z 2025-05-07T19:46:05.7116817Z 2025-05-07T19:46:05.7117579Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:05.7118463Z 2025-05-07T19:46:05.7118466Z 2025-05-07T19:46:05.7118487Z 2025-05-07T19:46:05.7118490Z 2025-05-07T19:46:05.7118494Z 2025-05-07T19:46:05.7118497Z 2025-05-07T19:46:05.7118517Z 2025-05-07T19:46:05.7118758Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:05.7119039Z 2025-05-07T19:46:05.7119042Z 2025-05-07T19:46:05.7119046Z 2025-05-07T19:46:05.7119065Z 2025-05-07T19:46:05.7119068Z 2025-05-07T19:46:05.7119076Z 2025-05-07T19:46:05.7119079Z 2025-05-07T19:46:05.7119083Z 2025-05-07T19:46:05.7119350Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:05.7119654Z 2025-05-07T19:46:05.7119658Z 2025-05-07T19:46:05.7119661Z 2025-05-07T19:46:05.7119665Z 2025-05-07T19:46:05.7119674Z 2025-05-07T19:46:05.7119693Z 2025-05-07T19:46:05.7119697Z 2025-05-07T19:46:05.7119700Z 2025-05-07T19:46:05.7119704Z 2025-05-07T19:46:05.7122649Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:05.7123016Z 2025-05-07T19:46:05.7123021Z 2025-05-07T19:46:05.7123026Z 2025-05-07T19:46:05.7123030Z 2025-05-07T19:46:05.7123051Z 2025-05-07T19:46:05.7123056Z 2025-05-07T19:46:05.7123060Z 2025-05-07T19:46:05.7123064Z 2025-05-07T19:46:05.7123068Z 2025-05-07T19:46:05.7123072Z 2025-05-07T19:46:05.7126065Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:05.7126380Z 2025-05-07T19:46:05.7126385Z 2025-05-07T19:46:05.7126673Z 2025-05-07T19:46:05.7126682Z 2025-05-07T19:46:05.7126685Z 2025-05-07T19:46:05.7126689Z 2025-05-07T19:46:05.7126693Z 2025-05-07T19:46:05.7126697Z 2025-05-07T19:46:05.7126701Z 2025-05-07T19:46:05.7126706Z 2025-05-07T19:46:05.7126710Z 2025-05-07T19:46:05.7130114Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:05.7130580Z 2025-05-07T19:46:05.7130584Z 2025-05-07T19:46:05.7130587Z 2025-05-07T19:46:05.7130591Z 2025-05-07T19:46:05.7130594Z 2025-05-07T19:46:05.7130598Z 2025-05-07T19:46:05.7130602Z 2025-05-07T19:46:05.7130605Z 2025-05-07T19:46:05.7130609Z 2025-05-07T19:46:05.7130612Z 2025-05-07T19:46:05.7130616Z 2025-05-07T19:46:05.7130619Z 2025-05-07T19:46:05.7130899Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:05.7131228Z 2025-05-07T19:46:05.7131232Z 2025-05-07T19:46:05.7131236Z 2025-05-07T19:46:05.7131239Z 2025-05-07T19:46:05.7131243Z 2025-05-07T19:46:05.7131252Z 2025-05-07T19:46:05.7131255Z 2025-05-07T19:46:05.7131259Z 2025-05-07T19:46:05.7131262Z 2025-05-07T19:46:05.7131266Z 2025-05-07T19:46:05.7131270Z 2025-05-07T19:46:05.7131273Z 2025-05-07T19:46:05.7131277Z 2025-05-07T19:46:05.7131590Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:05.7131924Z 2025-05-07T19:46:05.7131928Z 2025-05-07T19:46:05.7131931Z 2025-05-07T19:46:05.7131934Z 2025-05-07T19:46:05.7131938Z 2025-05-07T19:46:05.7131941Z 2025-05-07T19:46:05.7131946Z 2025-05-07T19:46:05.7131949Z 2025-05-07T19:46:05.7131953Z 2025-05-07T19:46:05.7131956Z 2025-05-07T19:46:05.7131960Z 2025-05-07T19:46:05.7131963Z 2025-05-07T19:46:05.7131967Z 2025-05-07T19:46:05.7131971Z 2025-05-07T19:46:05.7132297Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:05.7132626Z 2025-05-07T19:46:05.7132629Z 2025-05-07T19:46:05.7132633Z 2025-05-07T19:46:05.7132636Z 2025-05-07T19:46:05.7132646Z 2025-05-07T19:46:05.7132649Z 2025-05-07T19:46:05.7132653Z 2025-05-07T19:46:05.7132657Z 2025-05-07T19:46:05.7132660Z 2025-05-07T19:46:05.7132664Z 2025-05-07T19:46:05.7132685Z 2025-05-07T19:46:05.7132689Z 2025-05-07T19:46:05.7132692Z 2025-05-07T19:46:05.7132696Z 2025-05-07T19:46:05.7132704Z 2025-05-07T19:46:05.7133035Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:05.7133368Z 2025-05-07T19:46:05.7133390Z 2025-05-07T19:46:05.7133394Z 2025-05-07T19:46:05.7133398Z 2025-05-07T19:46:05.7133401Z 2025-05-07T19:46:05.7133405Z 2025-05-07T19:46:05.7133408Z 2025-05-07T19:46:05.7133411Z 2025-05-07T19:46:05.7133415Z 2025-05-07T19:46:05.7133418Z 2025-05-07T19:46:05.7133422Z 2025-05-07T19:46:05.7133426Z 2025-05-07T19:46:05.7133429Z 2025-05-07T19:46:05.7133433Z 2025-05-07T19:46:05.7133436Z 2025-05-07T19:46:05.7133440Z 2025-05-07T19:46:05.7133769Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:05.7134133Z 2025-05-07T19:46:05.7134136Z 2025-05-07T19:46:05.7134140Z 2025-05-07T19:46:05.7134143Z 2025-05-07T19:46:05.7134147Z 2025-05-07T19:46:05.7134151Z 2025-05-07T19:46:05.7134154Z 2025-05-07T19:46:05.7134158Z 2025-05-07T19:46:05.7134165Z 2025-05-07T19:46:05.7134169Z 2025-05-07T19:46:05.7134172Z 2025-05-07T19:46:05.7134176Z 2025-05-07T19:46:05.7134179Z 2025-05-07T19:46:05.7134184Z 2025-05-07T19:46:05.7134187Z 2025-05-07T19:46:05.7134191Z 2025-05-07T19:46:05.7134194Z 2025-05-07T19:46:05.7136028Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:05.7136381Z 2025-05-07T19:46:05.7136385Z 2025-05-07T19:46:05.7136388Z 2025-05-07T19:46:05.7136392Z 2025-05-07T19:46:05.7136395Z 2025-05-07T19:46:05.7136398Z 2025-05-07T19:46:05.7136402Z 2025-05-07T19:46:05.7136405Z 2025-05-07T19:46:05.7136425Z 2025-05-07T19:46:05.7136428Z 2025-05-07T19:46:05.7136432Z 2025-05-07T19:46:05.7136511Z 2025-05-07T19:46:05.7136516Z 2025-05-07T19:46:05.7136519Z 2025-05-07T19:46:05.7136523Z 2025-05-07T19:46:05.7136526Z 2025-05-07T19:46:05.7136529Z 2025-05-07T19:46:05.7136533Z 2025-05-07T19:46:05.7137217Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:05.7137656Z 2025-05-07T19:46:05.7137660Z 2025-05-07T19:46:05.7137663Z 2025-05-07T19:46:05.7137667Z 2025-05-07T19:46:05.7137670Z 2025-05-07T19:46:05.7137674Z 2025-05-07T19:46:05.7137677Z 2025-05-07T19:46:05.7137681Z 2025-05-07T19:46:05.7137685Z 2025-05-07T19:46:05.7137689Z 2025-05-07T19:46:05.7137692Z 2025-05-07T19:46:05.7137696Z 2025-05-07T19:46:05.7137700Z 2025-05-07T19:46:05.7137703Z 2025-05-07T19:46:05.7137707Z 2025-05-07T19:46:05.7137711Z 2025-05-07T19:46:05.7137714Z 2025-05-07T19:46:05.7137718Z 2025-05-07T19:46:05.7137721Z 2025-05-07T19:46:05.8055319Z ... (more hidden) ... 2025-05-07T19:46:05.8059825Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:05.8060113Z 2025-05-07T19:46:05.8067197Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:05.8067973Z 2025-05-07T19:46:05.8067986Z 2025-05-07T19:46:05.8088630Z libcufft-11.3.0.4 | 156.2 MB | 2 | 2%  2025-05-07T19:46:05.8089137Z 2025-05-07T19:46:05.8089192Z 2025-05-07T19:46:05.8089197Z 2025-05-07T19:46:05.8096826Z libcusparse-12.5.4.2 | 118.6 MB | 1 | 1%  2025-05-07T19:46:05.8097724Z 2025-05-07T19:46:05.8097735Z 2025-05-07T19:46:05.8097746Z 2025-05-07T19:46:05.8097757Z 2025-05-07T19:46:05.9055304Z cuda-nsight-12.6.77 | 113.2 MB | | 1%  2025-05-07T19:46:05.9062259Z nsight-compute-2024. | 443.1 MB | 1 | 1% 2025-05-07T19:46:05.9062521Z 2025-05-07T19:46:05.9067232Z libcublas-12.6.4.1 | 256.2 MB | 2 | 3%  2025-05-07T19:46:05.9067493Z 2025-05-07T19:46:05.9067504Z 2025-05-07T19:46:05.9089524Z libcufft-11.3.0.4 | 156.2 MB | 6 | 6%  2025-05-07T19:46:05.9089927Z 2025-05-07T19:46:05.9090130Z 2025-05-07T19:46:05.9090135Z 2025-05-07T19:46:05.9097576Z libcusparse-12.5.4.2 | 118.6 MB | 6 | 6%  2025-05-07T19:46:05.9097898Z 2025-05-07T19:46:05.9097903Z 2025-05-07T19:46:05.9097920Z 2025-05-07T19:46:05.9097934Z 2025-05-07T19:46:06.0058378Z cuda-nsight-12.6.77 | 113.2 MB | 5 | 6%  2025-05-07T19:46:06.0064499Z nsight-compute-2024. | 443.1 MB | 2 | 3% 2025-05-07T19:46:06.0065572Z 2025-05-07T19:46:06.0069480Z libcublas-12.6.4.1 | 256.2 MB | 5 | 5%  2025-05-07T19:46:06.0070267Z 2025-05-07T19:46:06.0070281Z 2025-05-07T19:46:06.0089459Z libcufft-11.3.0.4 | 156.2 MB | # | 10%  2025-05-07T19:46:06.0089883Z 2025-05-07T19:46:06.0089982Z 2025-05-07T19:46:06.0089986Z 2025-05-07T19:46:06.0099812Z libcusparse-12.5.4.2 | 118.6 MB | #1 | 12%  2025-05-07T19:46:06.0100147Z 2025-05-07T19:46:06.0100190Z 2025-05-07T19:46:06.0100194Z 2025-05-07T19:46:06.0100198Z 2025-05-07T19:46:06.1056705Z cuda-nsight-12.6.77 | 113.2 MB | #1 | 11%  2025-05-07T19:46:06.1074230Z nsight-compute-2024. | 443.1 MB | 3 | 4% 2025-05-07T19:46:06.1075007Z 2025-05-07T19:46:06.1075056Z 2025-05-07T19:46:06.1102285Z libcufft-11.3.0.4 | 156.2 MB | #4 | 15%  2025-05-07T19:46:06.1103135Z 2025-05-07T19:46:06.1103150Z 2025-05-07T19:46:06.1103162Z 2025-05-07T19:46:06.1103173Z 2025-05-07T19:46:06.1106603Z cuda-nsight-12.6.77 | 113.2 MB | #7 | 17%  2025-05-07T19:46:06.1107465Z 2025-05-07T19:46:06.1326159Z libcublas-12.6.4.1 | 256.2 MB | 7 | 7%  2025-05-07T19:46:06.1326518Z 2025-05-07T19:46:06.1326585Z 2025-05-07T19:46:06.1326589Z 2025-05-07T19:46:06.2114486Z libcusparse-12.5.4.2 | 118.6 MB | #5 | 16%  2025-05-07T19:46:06.2114853Z 2025-05-07T19:46:06.2115540Z libcublas-12.6.4.1 | 256.2 MB | 9 | 9%  2025-05-07T19:46:06.2115810Z 2025-05-07T19:46:06.2115822Z 2025-05-07T19:46:06.2115826Z 2025-05-07T19:46:06.2115830Z 2025-05-07T19:46:06.2118440Z cuda-nsight-12.6.77 | 113.2 MB | ##2 | 22%  2025-05-07T19:46:06.2252078Z nsight-compute-2024. | 443.1 MB | 5 | 5% 2025-05-07T19:46:06.2252921Z 2025-05-07T19:46:06.2253005Z 2025-05-07T19:46:06.2421230Z libcufft-11.3.0.4 | 156.2 MB | #8 | 19%  2025-05-07T19:46:06.2422067Z 2025-05-07T19:46:06.2422099Z 2025-05-07T19:46:06.2422110Z 2025-05-07T19:46:06.3120092Z libcusparse-12.5.4.2 | 118.6 MB | #9 | 20%  2025-05-07T19:46:06.3160513Z nsight-compute-2024. | 443.1 MB | 6 | 6% 2025-05-07T19:46:06.3161325Z 2025-05-07T19:46:06.3233333Z libcublas-12.6.4.1 | 256.2 MB | #1 | 11%  2025-05-07T19:46:06.3234195Z 2025-05-07T19:46:06.3234209Z 2025-05-07T19:46:06.3234220Z 2025-05-07T19:46:06.3234231Z 2025-05-07T19:46:06.3438128Z cuda-nsight-12.6.77 | 113.2 MB | ##7 | 27%  2025-05-07T19:46:06.3438476Z 2025-05-07T19:46:06.3438481Z 2025-05-07T19:46:06.3841413Z libcufft-11.3.0.4 | 156.2 MB | ##2 | 23%  2025-05-07T19:46:06.3841722Z 2025-05-07T19:46:06.3841727Z 2025-05-07T19:46:06.3841731Z 2025-05-07T19:46:06.4120342Z libcusparse-12.5.4.2 | 118.6 MB | ##3 | 24%  2025-05-07T19:46:06.4161679Z nsight-compute-2024. | 443.1 MB | 7 | 7% 2025-05-07T19:46:06.4161980Z 2025-05-07T19:46:06.4440447Z libcublas-12.6.4.1 | 256.2 MB | #3 | 14%  2025-05-07T19:46:06.4440757Z 2025-05-07T19:46:06.4440766Z 2025-05-07T19:46:06.4484177Z libcufft-11.3.0.4 | 156.2 MB | ##6 | 27%  2025-05-07T19:46:06.4485040Z 2025-05-07T19:46:06.4485053Z 2025-05-07T19:46:06.4485064Z 2025-05-07T19:46:06.4485090Z 2025-05-07T19:46:06.4844075Z cuda-nsight-12.6.77 | 113.2 MB | ###1 | 32%  2025-05-07T19:46:06.4844407Z 2025-05-07T19:46:06.4844413Z 2025-05-07T19:46:06.4844601Z 2025-05-07T19:46:06.5287887Z libcusparse-12.5.4.2 | 118.6 MB | ##7 | 27%  2025-05-07T19:46:06.5288777Z 2025-05-07T19:46:06.5292011Z libcublas-12.6.4.1 | 256.2 MB | #5 | 16%  2025-05-07T19:46:06.5487123Z nsight-compute-2024. | 443.1 MB | 8 | 9% 2025-05-07T19:46:06.5487468Z 2025-05-07T19:46:06.5487473Z 2025-05-07T19:46:06.5487478Z 2025-05-07T19:46:06.5487483Z 2025-05-07T19:46:06.5579760Z cuda-nsight-12.6.77 | 113.2 MB | ###6 | 36%  2025-05-07T19:46:06.5580128Z 2025-05-07T19:46:06.5580133Z 2025-05-07T19:46:06.5848961Z libcufft-11.3.0.4 | 156.2 MB | ### | 31%  2025-05-07T19:46:06.5849344Z 2025-05-07T19:46:06.5849413Z 2025-05-07T19:46:06.5849486Z 2025-05-07T19:46:06.6330732Z libcusparse-12.5.4.2 | 118.6 MB | ###1 | 31%  2025-05-07T19:46:06.6331638Z 2025-05-07T19:46:06.6411275Z libcublas-12.6.4.1 | 256.2 MB | #7 | 18%  2025-05-07T19:46:06.6488518Z nsight-compute-2024. | 443.1 MB | 9 | 10% 2025-05-07T19:46:06.6488905Z 2025-05-07T19:46:06.6488972Z 2025-05-07T19:46:06.6489020Z 2025-05-07T19:46:06.6489031Z 2025-05-07T19:46:06.6673208Z cuda-nsight-12.6.77 | 113.2 MB | #### | 41%  2025-05-07T19:46:06.6673525Z 2025-05-07T19:46:06.6673530Z 2025-05-07T19:46:06.6850248Z libcufft-11.3.0.4 | 156.2 MB | ###4 | 34%  2025-05-07T19:46:06.6850547Z 2025-05-07T19:46:06.6850552Z 2025-05-07T19:46:06.6850556Z 2025-05-07T19:46:06.7329636Z libcusparse-12.5.4.2 | 118.6 MB | ###5 | 35%  2025-05-07T19:46:06.7330056Z 2025-05-07T19:46:06.7412171Z libcublas-12.6.4.1 | 256.2 MB | ## | 20%  2025-05-07T19:46:06.7682331Z nsight-compute-2024. | 443.1 MB | #1 | 11% 2025-05-07T19:46:06.7682633Z 2025-05-07T19:46:06.7682638Z 2025-05-07T19:46:06.7849346Z libcufft-11.3.0.4 | 156.2 MB | ###7 | 38%  2025-05-07T19:46:06.7849807Z 2025-05-07T19:46:06.7849912Z 2025-05-07T19:46:06.7849949Z 2025-05-07T19:46:06.7984230Z libcusparse-12.5.4.2 | 118.6 MB | ###9 | 40%  2025-05-07T19:46:06.7985150Z 2025-05-07T19:46:06.7985164Z 2025-05-07T19:46:06.7985175Z 2025-05-07T19:46:06.7985185Z 2025-05-07T19:46:06.8422123Z cuda-nsight-12.6.77 | 113.2 MB | ####5 | 45%  2025-05-07T19:46:06.8422457Z 2025-05-07T19:46:06.8459212Z libcublas-12.6.4.1 | 256.2 MB | ##2 | 22%  2025-05-07T19:46:06.8713654Z nsight-compute-2024. | 443.1 MB | #2 | 12% 2025-05-07T19:46:06.8713951Z 2025-05-07T19:46:06.8713956Z 2025-05-07T19:46:06.8908504Z libcufft-11.3.0.4 | 156.2 MB | ####1 | 41%  2025-05-07T19:46:06.8908900Z 2025-05-07T19:46:06.8908979Z 2025-05-07T19:46:06.8908983Z 2025-05-07T19:46:06.8982355Z libcusparse-12.5.4.2 | 118.6 MB | ####3 | 44%  2025-05-07T19:46:06.8982747Z 2025-05-07T19:46:06.8982899Z 2025-05-07T19:46:06.8982903Z 2025-05-07T19:46:06.8982916Z 2025-05-07T19:46:06.9423540Z cuda-nsight-12.6.77 | 113.2 MB | ####9 | 50%  2025-05-07T19:46:06.9423925Z 2025-05-07T19:46:06.9486901Z libcublas-12.6.4.1 | 256.2 MB | ##4 | 24%  2025-05-07T19:46:06.9718900Z nsight-compute-2024. | 443.1 MB | #3 | 13% 2025-05-07T19:46:06.9719195Z 2025-05-07T19:46:06.9721106Z 2025-05-07T19:46:06.9908453Z libcufft-11.3.0.4 | 156.2 MB | ####4 | 45%  2025-05-07T19:46:06.9908841Z 2025-05-07T19:46:06.9908922Z 2025-05-07T19:46:06.9908926Z 2025-05-07T19:46:06.9987441Z libcusparse-12.5.4.2 | 118.6 MB | ####7 | 48%  2025-05-07T19:46:06.9988276Z 2025-05-07T19:46:06.9988281Z 2025-05-07T19:46:06.9988284Z 2025-05-07T19:46:06.9988288Z 2025-05-07T19:46:07.0490607Z cuda-nsight-12.6.77 | 113.2 MB | #####4 | 54%  2025-05-07T19:46:07.0490940Z 2025-05-07T19:46:07.0548616Z libcublas-12.6.4.1 | 256.2 MB | ##6 | 26%  2025-05-07T19:46:07.0816804Z nsight-compute-2024. | 443.1 MB | #4 | 14% 2025-05-07T19:46:07.0817316Z 2025-05-07T19:46:07.0817350Z 2025-05-07T19:46:07.0910136Z libcufft-11.3.0.4 | 156.2 MB | ####8 | 48%  2025-05-07T19:46:07.0910573Z 2025-05-07T19:46:07.0910602Z 2025-05-07T19:46:07.0910608Z 2025-05-07T19:46:07.0987776Z libcusparse-12.5.4.2 | 118.6 MB | #####1 | 52%  2025-05-07T19:46:07.0988184Z 2025-05-07T19:46:07.0988312Z 2025-05-07T19:46:07.0988316Z 2025-05-07T19:46:07.1492118Z 2025-05-07T19:46:07.1492556Z cuda-nsight-12.6.77 | 113.2 MB | #####9 | 59%  2025-05-07T19:46:07.1493186Z 2025-05-07T19:46:07.1820639Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 29%  2025-05-07T19:46:07.1821082Z 2025-05-07T19:46:07.1821172Z 2025-05-07T19:46:07.1913731Z libcufft-11.3.0.4 | 156.2 MB | #####3 | 53%  2025-05-07T19:46:07.1914563Z 2025-05-07T19:46:07.1914579Z 2025-05-07T19:46:07.1914590Z 2025-05-07T19:46:07.1987743Z libcusparse-12.5.4.2 | 118.6 MB | #####7 | 57%  2025-05-07T19:46:07.1988160Z 2025-05-07T19:46:07.1988363Z 2025-05-07T19:46:07.1988367Z 2025-05-07T19:46:07.1988386Z 2025-05-07T19:46:07.2252274Z cuda-nsight-12.6.77 | 113.2 MB | ######4 | 64%  2025-05-07T19:46:07.2517397Z nsight-compute-2024. | 443.1 MB | #5 | 15% 2025-05-07T19:46:07.2519053Z 2025-05-07T19:46:07.3032622Z libcublas-12.6.4.1 | 256.2 MB | ###1 | 31%  2025-05-07T19:46:07.3032967Z 2025-05-07T19:46:07.3032987Z 2025-05-07T19:46:07.3070164Z libcufft-11.3.0.4 | 156.2 MB | #####6 | 57%  2025-05-07T19:46:07.3070433Z 2025-05-07T19:46:07.3070447Z 2025-05-07T19:46:07.3070451Z 2025-05-07T19:46:07.3070454Z 2025-05-07T19:46:07.3075564Z cuda-nsight-12.6.77 | 113.2 MB | ######8 | 69%  2025-05-07T19:46:07.3075851Z 2025-05-07T19:46:07.3076763Z 2025-05-07T19:46:07.3076767Z 2025-05-07T19:46:07.3255480Z libcusparse-12.5.4.2 | 118.6 MB | ######1 | 62%  2025-05-07T19:46:07.3661903Z nsight-compute-2024. | 443.1 MB | #6 | 16% 2025-05-07T19:46:07.3662221Z 2025-05-07T19:46:07.4082395Z libcublas-12.6.4.1 | 256.2 MB | ###3 | 33%  2025-05-07T19:46:07.4082705Z 2025-05-07T19:46:07.4082709Z 2025-05-07T19:46:07.4082713Z 2025-05-07T19:46:07.4082716Z 2025-05-07T19:46:07.4122466Z cuda-nsight-12.6.77 | 113.2 MB | #######3 | 73%  2025-05-07T19:46:07.4122789Z 2025-05-07T19:46:07.4123283Z 2025-05-07T19:46:07.4210464Z libcufft-11.3.0.4 | 156.2 MB | ###### | 60%  2025-05-07T19:46:07.4210811Z 2025-05-07T19:46:07.4210817Z 2025-05-07T19:46:07.4210822Z 2025-05-07T19:46:07.4262150Z libcusparse-12.5.4.2 | 118.6 MB | ######6 | 66%  2025-05-07T19:46:07.4749714Z nsight-compute-2024. | 443.1 MB | #7 | 17% 2025-05-07T19:46:07.4750236Z 2025-05-07T19:46:07.5105528Z libcublas-12.6.4.1 | 256.2 MB | ###5 | 36%  2025-05-07T19:46:07.5106362Z 2025-05-07T19:46:07.5106378Z 2025-05-07T19:46:07.5106389Z 2025-05-07T19:46:07.5106399Z 2025-05-07T19:46:07.5224686Z cuda-nsight-12.6.77 | 113.2 MB | #######7 | 78%  2025-05-07T19:46:07.5225008Z 2025-05-07T19:46:07.5225077Z 2025-05-07T19:46:07.5265214Z libcufft-11.3.0.4 | 156.2 MB | ######3 | 64%  2025-05-07T19:46:07.5325900Z nsight-compute-2024. | 443.1 MB | #8 | 18% 2025-05-07T19:46:07.5326316Z 2025-05-07T19:46:07.5326432Z 2025-05-07T19:46:07.5751253Z 2025-05-07T19:46:07.5752570Z libcusparse-12.5.4.2 | 118.6 MB | ####### | 70%  2025-05-07T19:46:07.5753489Z 2025-05-07T19:46:07.6224654Z libcublas-12.6.4.1 | 256.2 MB | ###7 | 38%  2025-05-07T19:46:07.6225049Z 2025-05-07T19:46:07.6225265Z 2025-05-07T19:46:07.6313325Z libcufft-11.3.0.4 | 156.2 MB | ######7 | 68%  2025-05-07T19:46:07.6330029Z nsight-compute-2024. | 443.1 MB | #9 | 19% 2025-05-07T19:46:07.6330445Z 2025-05-07T19:46:07.6330564Z 2025-05-07T19:46:07.6330568Z 2025-05-07T19:46:07.6374788Z libcusparse-12.5.4.2 | 118.6 MB | #######4 | 75%  2025-05-07T19:46:07.6375114Z 2025-05-07T19:46:07.6375119Z 2025-05-07T19:46:07.6375123Z 2025-05-07T19:46:07.6375127Z 2025-05-07T19:46:07.6754863Z cuda-nsight-12.6.77 | 113.2 MB | ########2 | 82%  2025-05-07T19:46:07.6755199Z 2025-05-07T19:46:07.7238075Z libcublas-12.6.4.1 | 256.2 MB | ###9 | 40%  2025-05-07T19:46:07.7238897Z 2025-05-07T19:46:07.7239012Z 2025-05-07T19:46:07.7335970Z libcufft-11.3.0.4 | 156.2 MB | #######1 | 71%  2025-05-07T19:46:07.7336392Z 2025-05-07T19:46:07.7336652Z 2025-05-07T19:46:07.7336660Z 2025-05-07T19:46:07.7374393Z libcusparse-12.5.4.2 | 118.6 MB | #######8 | 79%  2025-05-07T19:46:07.7374701Z 2025-05-07T19:46:07.7374706Z 2025-05-07T19:46:07.7374725Z 2025-05-07T19:46:07.7374743Z 2025-05-07T19:46:07.7413558Z cuda-nsight-12.6.77 | 113.2 MB | ########6 | 87%  2025-05-07T19:46:07.7878671Z nsight-compute-2024. | 443.1 MB | ## | 20% 2025-05-07T19:46:07.7879571Z 2025-05-07T19:46:07.8332402Z libcublas-12.6.4.1 | 256.2 MB | ####2 | 42%  2025-05-07T19:46:07.8333244Z 2025-05-07T19:46:07.8333258Z 2025-05-07T19:46:07.8377265Z libcufft-11.3.0.4 | 156.2 MB | #######5 | 75%  2025-05-07T19:46:07.8378111Z 2025-05-07T19:46:07.8378126Z 2025-05-07T19:46:07.8378138Z 2025-05-07T19:46:07.8378148Z 2025-05-07T19:46:07.8386355Z cuda-nsight-12.6.77 | 113.2 MB | #########1 | 91%  2025-05-07T19:46:07.8386661Z 2025-05-07T19:46:07.8386665Z 2025-05-07T19:46:07.8386669Z 2025-05-07T19:46:07.8416171Z libcusparse-12.5.4.2 | 118.6 MB | ########2 | 83%  2025-05-07T19:46:07.8923095Z nsight-compute-2024. | 443.1 MB | ##1 | 21% 2025-05-07T19:46:07.8923511Z 2025-05-07T19:46:07.9332403Z libcublas-12.6.4.1 | 256.2 MB | ####4 | 44%  2025-05-07T19:46:07.9333219Z 2025-05-07T19:46:07.9333248Z 2025-05-07T19:46:07.9378387Z libcufft-11.3.0.4 | 156.2 MB | #######8 | 79%  2025-05-07T19:46:07.9378783Z 2025-05-07T19:46:07.9378986Z 2025-05-07T19:46:07.9378990Z 2025-05-07T19:46:07.9386449Z 2025-05-07T19:46:07.9387199Z cuda-nsight-12.6.77 | 113.2 MB | #########6 | 96%  2025-05-07T19:46:07.9387523Z 2025-05-07T19:46:07.9387527Z 2025-05-07T19:46:07.9387847Z 2025-05-07T19:46:07.9701037Z libcusparse-12.5.4.2 | 118.6 MB | ########7 | 87%  2025-05-07T19:46:07.9929384Z nsight-compute-2024. | 443.1 MB | ##2 | 22% 2025-05-07T19:46:07.9929932Z 2025-05-07T19:46:08.0333380Z libcublas-12.6.4.1 | 256.2 MB | ####6 | 47%  2025-05-07T19:46:08.0333870Z 2025-05-07T19:46:08.0334017Z 2025-05-07T19:46:08.0393179Z libcufft-11.3.0.4 | 156.2 MB | ########2 | 82%  2025-05-07T19:46:08.0393480Z 2025-05-07T19:46:08.0393485Z 2025-05-07T19:46:08.0393489Z 2025-05-07T19:46:08.0702942Z libcusparse-12.5.4.2 | 118.6 MB | #########1 | 92%  2025-05-07T19:46:08.0930032Z nsight-compute-2024. | 443.1 MB | ##3 | 24% 2025-05-07T19:46:08.0930469Z 2025-05-07T19:46:08.1338314Z libcublas-12.6.4.1 | 256.2 MB | ####8 | 49%  2025-05-07T19:46:08.1339138Z 2025-05-07T19:46:08.1339153Z 2025-05-07T19:46:08.1393256Z libcufft-11.3.0.4 | 156.2 MB | ########6 | 86%  2025-05-07T19:46:08.1393550Z 2025-05-07T19:46:08.1393555Z 2025-05-07T19:46:08.1393559Z 2025-05-07T19:46:08.1705845Z libcusparse-12.5.4.2 | 118.6 MB | #########6 | 96%  2025-05-07T19:46:08.1931678Z nsight-compute-2024. | 443.1 MB | ##4 | 25% 2025-05-07T19:46:08.1932124Z 2025-05-07T19:46:08.2336960Z libcublas-12.6.4.1 | 256.2 MB | #####1 | 51%  2025-05-07T19:46:08.2337369Z 2025-05-07T19:46:08.2337582Z 2025-05-07T19:46:08.2703758Z libcufft-11.3.0.4 | 156.2 MB | ######### | 90%  2025-05-07T19:46:08.2929411Z nsight-compute-2024. | 443.1 MB | ##6 | 27% 2025-05-07T19:46:08.2929796Z 2025-05-07T19:46:08.3335363Z libcublas-12.6.4.1 | 256.2 MB | #####4 | 54%  2025-05-07T19:46:08.3335733Z 2025-05-07T19:46:08.3335934Z 2025-05-07T19:46:08.3705179Z libcufft-11.3.0.4 | 156.2 MB | #########5 | 95%  2025-05-07T19:46:08.3932046Z nsight-compute-2024. | 443.1 MB | ##8 | 28% 2025-05-07T19:46:08.3932531Z 2025-05-07T19:46:08.4706024Z libcublas-12.6.4.1 | 256.2 MB | #####7 | 57%  2025-05-07T19:46:08.4933246Z nsight-compute-2024. | 443.1 MB | ### | 30% 2025-05-07T19:46:08.4933638Z 2025-05-07T19:46:08.5707738Z libcublas-12.6.4.1 | 256.2 MB | ######1 | 61%  2025-05-07T19:46:08.5933936Z nsight-compute-2024. | 443.1 MB | ###2 | 32% 2025-05-07T19:46:08.5934333Z 2025-05-07T19:46:08.6709231Z libcublas-12.6.4.1 | 256.2 MB | ######6 | 66%  2025-05-07T19:46:08.6937675Z nsight-compute-2024. | 443.1 MB | ###4 | 34% 2025-05-07T19:46:08.6938060Z 2025-05-07T19:46:08.8010917Z libcublas-12.6.4.1 | 256.2 MB | ####### | 71%  2025-05-07T19:46:08.8011303Z 2025-05-07T19:46:08.8747045Z libcublas-12.6.4.1 | 256.2 MB | #######6 | 76%  2025-05-07T19:46:08.9010332Z nsight-compute-2024. | 443.1 MB | ###6 | 36% 2025-05-07T19:46:08.9010736Z 2025-05-07T19:46:08.9423543Z libcublas-12.6.4.1 | 256.2 MB | ######## | 81%  2025-05-07T19:46:08.9423948Z 2025-05-07T19:46:08.9424304Z 2025-05-07T19:46:08.9424332Z 2025-05-07T19:46:08.9424337Z 2025-05-07T19:46:08.9748668Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:08.9805416Z nsight-compute-2024. | 443.1 MB | ###8 | 38% 2025-05-07T19:46:08.9805716Z 2025-05-07T19:46:08.9805721Z 2025-05-07T19:46:08.9805725Z 2025-05-07T19:46:08.9805728Z 2025-05-07T19:46:08.9805741Z 2025-05-07T19:46:09.0010742Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:09.0011071Z 2025-05-07T19:46:09.0749012Z libcublas-12.6.4.1 | 256.2 MB | ########5 | 85%  2025-05-07T19:46:09.0807402Z nsight-compute-2024. | 443.1 MB | #### | 40% 2025-05-07T19:46:09.0807740Z 2025-05-07T19:46:09.0807999Z 2025-05-07T19:46:09.0808007Z 2025-05-07T19:46:09.0808014Z 2025-05-07T19:46:09.0808024Z 2025-05-07T19:46:09.1649801Z cuda-nvvp-12.6.80 | 109.3 MB | 5 | 6%  2025-05-07T19:46:09.1650386Z 2025-05-07T19:46:09.1802080Z libcublas-12.6.4.1 | 256.2 MB | ########9 | 90%  2025-05-07T19:46:09.1802404Z 2025-05-07T19:46:09.1802409Z 2025-05-07T19:46:09.1802413Z 2025-05-07T19:46:09.1805345Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:09.1805630Z 2025-05-07T19:46:09.1805758Z 2025-05-07T19:46:09.1805762Z 2025-05-07T19:46:09.1805765Z 2025-05-07T19:46:09.1805787Z 2025-05-07T19:46:09.1854680Z cuda-nvvp-12.6.80 | 109.3 MB | #2 | 12%  2025-05-07T19:46:09.2242164Z nsight-compute-2024. | 443.1 MB | ####1 | 42% 2025-05-07T19:46:09.2242466Z 2025-05-07T19:46:09.2242483Z 2025-05-07T19:46:09.2242487Z 2025-05-07T19:46:09.2242491Z 2025-05-07T19:46:09.2242495Z 2025-05-07T19:46:09.2242863Z 2025-05-07T19:46:09.2865534Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:09.2866063Z 2025-05-07T19:46:09.2866102Z 2025-05-07T19:46:09.2866122Z 2025-05-07T19:46:09.2866127Z 2025-05-07T19:46:09.2866135Z 2025-05-07T19:46:09.3186918Z cuda-nvvp-12.6.80 | 109.3 MB | #7 | 17%  2025-05-07T19:46:09.3247520Z nsight-compute-2024. | 443.1 MB | ####3 | 43% 2025-05-07T19:46:09.3247844Z 2025-05-07T19:46:09.3247849Z 2025-05-07T19:46:09.3247854Z 2025-05-07T19:46:09.3247858Z 2025-05-07T19:46:09.3247880Z 2025-05-07T19:46:09.3247884Z 2025-05-07T19:46:09.3382118Z libcusolver-11.7.1.2 | 95.8 MB | 6 | 6%  2025-05-07T19:46:09.3382575Z 2025-05-07T19:46:09.4070515Z libcublas-12.6.4.1 | 256.2 MB | #########3 | 94%  2025-05-07T19:46:09.4070914Z 2025-05-07T19:46:09.4071056Z 2025-05-07T19:46:09.4071064Z 2025-05-07T19:46:09.4071069Z 2025-05-07T19:46:09.4071091Z 2025-05-07T19:46:09.4250098Z cuda-nvvp-12.6.80 | 109.3 MB | ##1 | 22%  2025-05-07T19:46:09.4250423Z 2025-05-07T19:46:09.4250557Z 2025-05-07T19:46:09.4250564Z 2025-05-07T19:46:09.4250569Z 2025-05-07T19:46:09.4250574Z 2025-05-07T19:46:09.4250579Z 2025-05-07T19:46:09.4529146Z libcusolver-11.7.1.2 | 95.8 MB | #1 | 12%  2025-05-07T19:46:09.5070795Z nsight-compute-2024. | 443.1 MB | ####5 | 45% 2025-05-07T19:46:09.5071278Z 2025-05-07T19:46:09.5071325Z 2025-05-07T19:46:09.5071357Z 2025-05-07T19:46:09.5071486Z 2025-05-07T19:46:09.5071520Z 2025-05-07T19:46:09.5075171Z cuda-nvvp-12.6.80 | 109.3 MB | ##6 | 27%  2025-05-07T19:46:09.5075485Z 2025-05-07T19:46:09.5250139Z libcublas-12.6.4.1 | 256.2 MB | #########7 | 97%  2025-05-07T19:46:09.5250744Z 2025-05-07T19:46:09.5250785Z 2025-05-07T19:46:09.5250790Z 2025-05-07T19:46:09.5250794Z 2025-05-07T19:46:09.5250798Z 2025-05-07T19:46:09.5250816Z 2025-05-07T19:46:09.5717639Z libcusolver-11.7.1.2 | 95.8 MB | #7 | 17%  2025-05-07T19:46:09.6072045Z nsight-compute-2024. | 443.1 MB | ####6 | 46% 2025-05-07T19:46:09.6072381Z 2025-05-07T19:46:09.6072386Z 2025-05-07T19:46:09.6072391Z 2025-05-07T19:46:09.6072396Z 2025-05-07T19:46:09.6073954Z 2025-05-07T19:46:09.6255244Z cuda-nvvp-12.6.80 | 109.3 MB | ###1 | 31%  2025-05-07T19:46:09.6255627Z 2025-05-07T19:46:09.6255846Z 2025-05-07T19:46:09.6255853Z 2025-05-07T19:46:09.6255858Z 2025-05-07T19:46:09.6257178Z 2025-05-07T19:46:09.6257183Z 2025-05-07T19:46:09.6793827Z libcusolver-11.7.1.2 | 95.8 MB | ##2 | 23%  2025-05-07T19:46:09.7073418Z nsight-compute-2024. | 443.1 MB | ####7 | 48% 2025-05-07T19:46:09.7073743Z 2025-05-07T19:46:09.7073748Z 2025-05-07T19:46:09.7073753Z 2025-05-07T19:46:09.7073773Z 2025-05-07T19:46:09.7073778Z 2025-05-07T19:46:09.7255518Z cuda-nvvp-12.6.80 | 109.3 MB | ###7 | 37%  2025-05-07T19:46:09.7255890Z 2025-05-07T19:46:09.7256118Z 2025-05-07T19:46:09.7256128Z 2025-05-07T19:46:09.7256161Z 2025-05-07T19:46:09.7256166Z 2025-05-07T19:46:09.7256171Z 2025-05-07T19:46:09.7795156Z libcusolver-11.7.1.2 | 95.8 MB | ##9 | 29%  2025-05-07T19:46:09.8073767Z nsight-compute-2024. | 443.1 MB | ####9 | 49% 2025-05-07T19:46:09.8074073Z 2025-05-07T19:46:09.8074078Z 2025-05-07T19:46:09.8074082Z 2025-05-07T19:46:09.8074085Z 2025-05-07T19:46:09.8074672Z 2025-05-07T19:46:09.8256179Z cuda-nvvp-12.6.80 | 109.3 MB | ####2 | 43%  2025-05-07T19:46:09.8256814Z 2025-05-07T19:46:09.8256819Z 2025-05-07T19:46:09.8256823Z 2025-05-07T19:46:09.8256826Z 2025-05-07T19:46:09.8256843Z 2025-05-07T19:46:09.8256847Z 2025-05-07T19:46:09.8798576Z libcusolver-11.7.1.2 | 95.8 MB | ###5 | 35%  2025-05-07T19:46:09.9193609Z nsight-compute-2024. | 443.1 MB | ##### | 51% 2025-05-07T19:46:09.9193927Z 2025-05-07T19:46:09.9193933Z 2025-05-07T19:46:09.9193937Z 2025-05-07T19:46:09.9193941Z 2025-05-07T19:46:09.9193945Z 2025-05-07T19:46:09.9260582Z cuda-nvvp-12.6.80 | 109.3 MB | ####8 | 48%  2025-05-07T19:46:09.9260917Z 2025-05-07T19:46:09.9260922Z 2025-05-07T19:46:09.9260926Z 2025-05-07T19:46:09.9260951Z 2025-05-07T19:46:09.9260955Z 2025-05-07T19:46:09.9262411Z 2025-05-07T19:46:09.9800144Z libcusolver-11.7.1.2 | 95.8 MB | ####2 | 43%  2025-05-07T19:46:10.0196118Z nsight-compute-2024. | 443.1 MB | #####2 | 52% 2025-05-07T19:46:10.0196930Z 2025-05-07T19:46:10.0196979Z 2025-05-07T19:46:10.0196991Z 2025-05-07T19:46:10.0197002Z 2025-05-07T19:46:10.0197013Z 2025-05-07T19:46:10.0262185Z cuda-nvvp-12.6.80 | 109.3 MB | #####4 | 54%  2025-05-07T19:46:10.0262509Z 2025-05-07T19:46:10.0262571Z 2025-05-07T19:46:10.0262575Z 2025-05-07T19:46:10.0262579Z 2025-05-07T19:46:10.0262585Z 2025-05-07T19:46:10.0262621Z 2025-05-07T19:46:10.0878493Z libcusolver-11.7.1.2 | 95.8 MB | ####9 | 49%  2025-05-07T19:46:10.1194823Z nsight-compute-2024. | 443.1 MB | #####3 | 54% 2025-05-07T19:46:10.1195142Z 2025-05-07T19:46:10.1195147Z 2025-05-07T19:46:10.1195152Z 2025-05-07T19:46:10.1195156Z 2025-05-07T19:46:10.1195160Z 2025-05-07T19:46:10.1263030Z cuda-nvvp-12.6.80 | 109.3 MB | ###### | 60%  2025-05-07T19:46:10.1263353Z 2025-05-07T19:46:10.1263509Z 2025-05-07T19:46:10.1263516Z 2025-05-07T19:46:10.1263521Z 2025-05-07T19:46:10.1263526Z 2025-05-07T19:46:10.1263535Z 2025-05-07T19:46:10.1904382Z libcusolver-11.7.1.2 | 95.8 MB | #####5 | 56%  2025-05-07T19:46:10.2212626Z nsight-compute-2024. | 443.1 MB | #####5 | 55% 2025-05-07T19:46:10.2212956Z 2025-05-07T19:46:10.2212961Z 2025-05-07T19:46:10.2212966Z 2025-05-07T19:46:10.2212971Z 2025-05-07T19:46:10.2212976Z 2025-05-07T19:46:10.2264866Z cuda-nvvp-12.6.80 | 109.3 MB | ######5 | 66%  2025-05-07T19:46:10.2265200Z 2025-05-07T19:46:10.2265205Z 2025-05-07T19:46:10.2265208Z 2025-05-07T19:46:10.2265212Z 2025-05-07T19:46:10.2265215Z 2025-05-07T19:46:10.2265219Z 2025-05-07T19:46:10.2606693Z libcusolver-11.7.1.2 | 95.8 MB | ######2 | 62%  2025-05-07T19:46:10.2607652Z 2025-05-07T19:46:10.2607700Z 2025-05-07T19:46:10.2608379Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:10.2609067Z 2025-05-07T19:46:10.2609071Z 2025-05-07T19:46:10.2911607Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:10.2945421Z nsight-compute-2024. | 443.1 MB | #####6 | 56% 2025-05-07T19:46:10.2945841Z 2025-05-07T19:46:10.2945970Z 2025-05-07T19:46:10.2945978Z 2025-05-07T19:46:10.2946002Z 2025-05-07T19:46:10.2946018Z 2025-05-07T19:46:10.2946039Z 2025-05-07T19:46:10.2946076Z 2025-05-07T19:46:10.3399265Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:10.3399589Z 2025-05-07T19:46:10.3399593Z 2025-05-07T19:46:10.3399597Z 2025-05-07T19:46:10.3399600Z 2025-05-07T19:46:10.3399604Z 2025-05-07T19:46:10.3399607Z 2025-05-07T19:46:10.3942753Z libcusolver-11.7.1.2 | 95.8 MB | ######8 | 69%  2025-05-07T19:46:10.3943713Z 2025-05-07T19:46:10.3943728Z 2025-05-07T19:46:10.3944166Z 2025-05-07T19:46:10.3944181Z 2025-05-07T19:46:10.3944192Z 2025-05-07T19:46:10.3947921Z cuda-nvvp-12.6.80 | 109.3 MB | #######1 | 71%  2025-05-07T19:46:10.3948212Z 2025-05-07T19:46:10.3948223Z 2025-05-07T19:46:10.3948226Z 2025-05-07T19:46:10.3948230Z 2025-05-07T19:46:10.3948363Z 2025-05-07T19:46:10.3948368Z 2025-05-07T19:46:10.3949292Z 2025-05-07T19:46:10.4181663Z libnpp-12.3.1.54 | 93.4 MB | 5 | 6%  2025-05-07T19:46:10.4801392Z nsight-compute-2024. | 443.1 MB | #####7 | 58% 2025-05-07T19:46:10.4801731Z 2025-05-07T19:46:10.4801737Z 2025-05-07T19:46:10.4801742Z 2025-05-07T19:46:10.4801747Z 2025-05-07T19:46:10.4801751Z 2025-05-07T19:46:10.4801756Z 2025-05-07T19:46:10.4952614Z libcusolver-11.7.1.2 | 95.8 MB | #######5 | 75%  2025-05-07T19:46:10.4953632Z 2025-05-07T19:46:10.4953655Z 2025-05-07T19:46:10.4953674Z 2025-05-07T19:46:10.4953713Z 2025-05-07T19:46:10.4953730Z 2025-05-07T19:46:10.4953786Z 2025-05-07T19:46:10.4953830Z 2025-05-07T19:46:10.5189207Z libnpp-12.3.1.54 | 93.4 MB | # | 10%  2025-05-07T19:46:10.5190098Z 2025-05-07T19:46:10.5190112Z 2025-05-07T19:46:10.5190124Z 2025-05-07T19:46:10.5190136Z 2025-05-07T19:46:10.5190165Z 2025-05-07T19:46:10.5558021Z cuda-nvvp-12.6.80 | 109.3 MB | #######6 | 76%  2025-05-07T19:46:10.5951850Z nsight-compute-2024. | 443.1 MB | #####9 | 59% 2025-05-07T19:46:10.5952260Z 2025-05-07T19:46:10.5952331Z 2025-05-07T19:46:10.5952336Z 2025-05-07T19:46:10.5952351Z 2025-05-07T19:46:10.5952356Z 2025-05-07T19:46:10.5952362Z 2025-05-07T19:46:10.5952557Z 2025-05-07T19:46:10.6070024Z libnpp-12.3.1.54 | 93.4 MB | #4 | 15%  2025-05-07T19:46:10.6070356Z 2025-05-07T19:46:10.6070360Z 2025-05-07T19:46:10.6070364Z 2025-05-07T19:46:10.6070368Z 2025-05-07T19:46:10.6070371Z 2025-05-07T19:46:10.6070375Z 2025-05-07T19:46:10.6108655Z libcusolver-11.7.1.2 | 95.8 MB | ######## | 81%  2025-05-07T19:46:10.6108986Z 2025-05-07T19:46:10.6108991Z 2025-05-07T19:46:10.6108995Z 2025-05-07T19:46:10.6108998Z 2025-05-07T19:46:10.6300724Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:10.6301106Z 2025-05-07T19:46:10.6301205Z 2025-05-07T19:46:10.6301208Z 2025-05-07T19:46:10.6301238Z 2025-05-07T19:46:10.6301241Z 2025-05-07T19:46:10.6826512Z cuda-nvvp-12.6.80 | 109.3 MB | ######## | 81%  2025-05-07T19:46:10.6953749Z nsight-compute-2024. | 443.1 MB | ###### | 60% 2025-05-07T19:46:10.6954049Z 2025-05-07T19:46:10.6954054Z 2025-05-07T19:46:10.6954058Z 2025-05-07T19:46:10.6954075Z 2025-05-07T19:46:10.6954078Z 2025-05-07T19:46:10.6954082Z 2025-05-07T19:46:10.6954085Z 2025-05-07T19:46:10.7221373Z libnpp-12.3.1.54 | 93.4 MB | #9 | 20%  2025-05-07T19:46:10.7221759Z 2025-05-07T19:46:10.7221829Z 2025-05-07T19:46:10.7221834Z 2025-05-07T19:46:10.7221871Z 2025-05-07T19:46:10.7221893Z 2025-05-07T19:46:10.7221907Z 2025-05-07T19:46:10.7306387Z libcusolver-11.7.1.2 | 95.8 MB | ########6 | 86%  2025-05-07T19:46:10.7307368Z 2025-05-07T19:46:10.7307382Z 2025-05-07T19:46:10.7307394Z 2025-05-07T19:46:10.7307405Z 2025-05-07T19:46:10.7307445Z 2025-05-07T19:46:10.7879912Z cuda-nvvp-12.6.80 | 109.3 MB | ########5 | 85%  2025-05-07T19:46:10.7956371Z nsight-compute-2024. | 443.1 MB | ######1 | 61% 2025-05-07T19:46:10.7957180Z 2025-05-07T19:46:10.7957196Z 2025-05-07T19:46:10.7957208Z 2025-05-07T19:46:10.7957219Z 2025-05-07T19:46:10.7957230Z 2025-05-07T19:46:10.7957256Z 2025-05-07T19:46:10.7957267Z 2025-05-07T19:46:10.8307507Z libnpp-12.3.1.54 | 93.4 MB | ##4 | 25%  2025-05-07T19:46:10.8307884Z 2025-05-07T19:46:10.8308038Z 2025-05-07T19:46:10.8308042Z 2025-05-07T19:46:10.8308045Z 2025-05-07T19:46:10.8308050Z 2025-05-07T19:46:10.8330691Z cuda-nvvp-12.6.80 | 109.3 MB | ########9 | 90%  2025-05-07T19:46:10.8331045Z 2025-05-07T19:46:10.8331120Z 2025-05-07T19:46:10.8331125Z 2025-05-07T19:46:10.8331128Z 2025-05-07T19:46:10.8331132Z 2025-05-07T19:46:10.8331152Z 2025-05-07T19:46:10.8951165Z libcusolver-11.7.1.2 | 95.8 MB | #########1 | 91%  2025-05-07T19:46:10.8957512Z nsight-compute-2024. | 443.1 MB | ######2 | 63% 2025-05-07T19:46:10.8958271Z 2025-05-07T19:46:10.8958304Z 2025-05-07T19:46:10.8958316Z 2025-05-07T19:46:10.8958326Z 2025-05-07T19:46:10.8958337Z 2025-05-07T19:46:10.8958347Z 2025-05-07T19:46:10.8960337Z 2025-05-07T19:46:10.9388436Z libnpp-12.3.1.54 | 93.4 MB | ##9 | 30%  2025-05-07T19:46:10.9388751Z 2025-05-07T19:46:10.9388804Z 2025-05-07T19:46:10.9388808Z 2025-05-07T19:46:10.9388882Z 2025-05-07T19:46:10.9388891Z 2025-05-07T19:46:10.9411712Z cuda-nvvp-12.6.80 | 109.3 MB | #########4 | 94%  2025-05-07T19:46:10.9412037Z 2025-05-07T19:46:10.9412042Z 2025-05-07T19:46:10.9412060Z 2025-05-07T19:46:10.9412064Z 2025-05-07T19:46:10.9412067Z 2025-05-07T19:46:10.9412084Z 2025-05-07T19:46:10.9959208Z libcusolver-11.7.1.2 | 95.8 MB | #########6 | 96%  2025-05-07T19:46:10.9959547Z 2025-05-07T19:46:10.9959551Z 2025-05-07T19:46:10.9959556Z 2025-05-07T19:46:10.9959574Z 2025-05-07T19:46:10.9959577Z 2025-05-07T19:46:10.9959581Z 2025-05-07T19:46:10.9959585Z 2025-05-07T19:46:10.9971190Z libnpp-12.3.1.54 | 93.4 MB | ###4 | 35%  2025-05-07T19:46:11.0961064Z nsight-compute-2024. | 443.1 MB | ######3 | 64% 2025-05-07T19:46:11.0961379Z 2025-05-07T19:46:11.0961385Z 2025-05-07T19:46:11.0961388Z 2025-05-07T19:46:11.0961392Z 2025-05-07T19:46:11.0961395Z 2025-05-07T19:46:11.0961399Z 2025-05-07T19:46:11.0961403Z 2025-05-07T19:46:11.0970674Z libnpp-12.3.1.54 | 93.4 MB | ####2 | 42%  2025-05-07T19:46:11.1574596Z nsight-compute-2024. | 443.1 MB | ######5 | 65% 2025-05-07T19:46:11.1574997Z 2025-05-07T19:46:11.1575079Z 2025-05-07T19:46:11.1575092Z 2025-05-07T19:46:11.1575116Z 2025-05-07T19:46:11.1575331Z 2025-05-07T19:46:11.1962316Z cuda-nvvp-12.6.80 | 109.3 MB | #########8 | 99%  2025-05-07T19:46:11.1962641Z 2025-05-07T19:46:11.1962646Z 2025-05-07T19:46:11.1962668Z 2025-05-07T19:46:11.1962672Z 2025-05-07T19:46:11.1962675Z 2025-05-07T19:46:11.1962679Z 2025-05-07T19:46:11.1962695Z 2025-05-07T19:46:11.1970863Z libnpp-12.3.1.54 | 93.4 MB | ##### | 50%  2025-05-07T19:46:11.2963358Z nsight-compute-2024. | 443.1 MB | ######6 | 67% 2025-05-07T19:46:11.2963662Z 2025-05-07T19:46:11.2963667Z 2025-05-07T19:46:11.2963671Z 2025-05-07T19:46:11.2963675Z 2025-05-07T19:46:11.2963679Z 2025-05-07T19:46:11.2963684Z 2025-05-07T19:46:11.2963694Z 2025-05-07T19:46:11.2970850Z libnpp-12.3.1.54 | 93.4 MB | #####8 | 58%  2025-05-07T19:46:11.3967546Z nsight-compute-2024. | 443.1 MB | ######8 | 69% 2025-05-07T19:46:11.3967956Z 2025-05-07T19:46:11.3968023Z 2025-05-07T19:46:11.3968041Z 2025-05-07T19:46:11.3968045Z 2025-05-07T19:46:11.3968049Z 2025-05-07T19:46:11.3968067Z 2025-05-07T19:46:11.3968082Z 2025-05-07T19:46:11.3972717Z libnpp-12.3.1.54 | 93.4 MB | ######6 | 66%  2025-05-07T19:46:11.4967433Z nsight-compute-2024. | 443.1 MB | ####### | 70% 2025-05-07T19:46:11.4967836Z 2025-05-07T19:46:11.4967929Z 2025-05-07T19:46:11.4967933Z 2025-05-07T19:46:11.4967937Z 2025-05-07T19:46:11.4967955Z 2025-05-07T19:46:11.4967959Z 2025-05-07T19:46:11.4967970Z 2025-05-07T19:46:11.4973398Z libnpp-12.3.1.54 | 93.4 MB | #######4 | 74%  2025-05-07T19:46:11.5968507Z nsight-compute-2024. | 443.1 MB | #######2 | 72% 2025-05-07T19:46:11.5968914Z 2025-05-07T19:46:11.5969063Z 2025-05-07T19:46:11.5969068Z 2025-05-07T19:46:11.5969091Z 2025-05-07T19:46:11.5969095Z 2025-05-07T19:46:11.5969099Z 2025-05-07T19:46:11.5969103Z 2025-05-07T19:46:11.5974176Z libnpp-12.3.1.54 | 93.4 MB | ########3 | 84%  2025-05-07T19:46:11.6968501Z nsight-compute-2024. | 443.1 MB | #######3 | 74% 2025-05-07T19:46:11.6968886Z 2025-05-07T19:46:11.6968974Z 2025-05-07T19:46:11.6968978Z 2025-05-07T19:46:11.6968999Z 2025-05-07T19:46:11.6969220Z 2025-05-07T19:46:11.6969257Z 2025-05-07T19:46:11.6969334Z 2025-05-07T19:46:11.6974977Z libnpp-12.3.1.54 | 93.4 MB | #########2 | 93%  2025-05-07T19:46:11.7978247Z nsight-compute-2024. | 443.1 MB | #######5 | 76% 2025-05-07T19:46:11.8978752Z nsight-compute-2024. | 443.1 MB | #######7 | 78% 2025-05-07T19:46:11.9980288Z nsight-compute-2024. | 443.1 MB | ######## | 81% 2025-05-07T19:46:12.1017621Z nsight-compute-2024. | 443.1 MB | ########3 | 83% 2025-05-07T19:46:12.2019552Z nsight-compute-2024. | 443.1 MB | ########5 | 86% 2025-05-07T19:46:12.2393137Z nsight-compute-2024. | 443.1 MB | ########8 | 88% 2025-05-07T19:46:12.2393488Z 2025-05-07T19:46:12.2393523Z 2025-05-07T19:46:12.2393528Z 2025-05-07T19:46:12.2393534Z 2025-05-07T19:46:12.2393538Z 2025-05-07T19:46:12.2393542Z 2025-05-07T19:46:12.2900376Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:12.2900893Z 2025-05-07T19:46:12.2900938Z 2025-05-07T19:46:12.2900964Z 2025-05-07T19:46:12.2900968Z 2025-05-07T19:46:12.2900972Z 2025-05-07T19:46:12.2900975Z 2025-05-07T19:46:12.2900978Z 2025-05-07T19:46:12.2900982Z 2025-05-07T19:46:12.3303218Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:12.3901046Z nsight-compute-2024. | 443.1 MB | ######### | 91% 2025-05-07T19:46:12.3901435Z 2025-05-07T19:46:12.3901519Z 2025-05-07T19:46:12.3901524Z 2025-05-07T19:46:12.3901544Z 2025-05-07T19:46:12.3901562Z 2025-05-07T19:46:12.3901567Z 2025-05-07T19:46:12.3901582Z 2025-05-07T19:46:12.3901587Z 2025-05-07T19:46:12.4417889Z cuda-nvdisasm-12.6.7 | 47.6 MB | #1 | 12%  2025-05-07T19:46:12.4944820Z nsight-compute-2024. | 443.1 MB | #########3 | 93% 2025-05-07T19:46:12.4945198Z 2025-05-07T19:46:12.4945269Z 2025-05-07T19:46:12.4945273Z 2025-05-07T19:46:12.4945288Z 2025-05-07T19:46:12.4945304Z 2025-05-07T19:46:12.4945309Z 2025-05-07T19:46:12.4945321Z 2025-05-07T19:46:12.4945405Z 2025-05-07T19:46:12.5262341Z cuda-nvdisasm-12.6.7 | 47.6 MB | #8 | 19%  2025-05-07T19:46:12.5262777Z 2025-05-07T19:46:12.5262956Z 2025-05-07T19:46:12.5262963Z 2025-05-07T19:46:12.5262968Z 2025-05-07T19:46:12.5262973Z 2025-05-07T19:46:12.5456745Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:12.5735393Z nsight-compute-2024. | 443.1 MB | #########5 | 95% 2025-05-07T19:46:12.5735807Z 2025-05-07T19:46:12.5735878Z 2025-05-07T19:46:12.5735883Z 2025-05-07T19:46:12.5735898Z 2025-05-07T19:46:12.5735901Z 2025-05-07T19:46:12.5735915Z 2025-05-07T19:46:12.5735927Z 2025-05-07T19:46:12.5735937Z 2025-05-07T19:46:12.5735949Z 2025-05-07T19:46:12.5946517Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:12.5946928Z 2025-05-07T19:46:12.5947080Z 2025-05-07T19:46:12.5947084Z 2025-05-07T19:46:12.5947212Z 2025-05-07T19:46:12.5947247Z 2025-05-07T19:46:12.5947252Z 2025-05-07T19:46:12.5947278Z 2025-05-07T19:46:12.5947283Z 2025-05-07T19:46:12.6735457Z cuda-nvdisasm-12.6.7 | 47.6 MB | ###2 | 33%  2025-05-07T19:46:12.6736031Z 2025-05-07T19:46:12.6736077Z 2025-05-07T19:46:12.6736081Z 2025-05-07T19:46:12.6736098Z 2025-05-07T19:46:12.6736103Z 2025-05-07T19:46:12.6736114Z 2025-05-07T19:46:12.6736118Z 2025-05-07T19:46:12.6736131Z 2025-05-07T19:46:12.6736147Z 2025-05-07T19:46:12.7001943Z libcurand-10.3.7.77 | 39.9 MB | #6 | 17%  2025-05-07T19:46:12.7737322Z nsight-compute-2024. | 443.1 MB | #########7 | 97% 2025-05-07T19:46:12.7737706Z 2025-05-07T19:46:12.7737770Z 2025-05-07T19:46:12.7737774Z 2025-05-07T19:46:12.7738045Z 2025-05-07T19:46:12.7738050Z 2025-05-07T19:46:12.7738053Z 2025-05-07T19:46:12.7738058Z 2025-05-07T19:46:12.7738062Z 2025-05-07T19:46:12.7738066Z 2025-05-07T19:46:12.8195054Z libcurand-10.3.7.77 | 39.9 MB | ####2 | 42%  2025-05-07T19:46:12.8195381Z 2025-05-07T19:46:12.8195614Z 2025-05-07T19:46:12.8195618Z 2025-05-07T19:46:12.8195621Z 2025-05-07T19:46:12.8195625Z 2025-05-07T19:46:12.8195628Z 2025-05-07T19:46:12.8195632Z 2025-05-07T19:46:12.8195643Z 2025-05-07T19:46:12.8789588Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####2 | 42%  2025-05-07T19:46:12.8869616Z nsight-compute-2024. | 443.1 MB | #########9 | 99% 2025-05-07T19:46:12.8869955Z 2025-05-07T19:46:12.8869961Z 2025-05-07T19:46:12.8869966Z 2025-05-07T19:46:12.8869970Z 2025-05-07T19:46:12.8869975Z 2025-05-07T19:46:12.8869980Z 2025-05-07T19:46:12.8869984Z 2025-05-07T19:46:12.8869989Z 2025-05-07T19:46:12.8870034Z 2025-05-07T19:46:12.9198771Z libcurand-10.3.7.77 | 39.9 MB | #####9 | 59%  2025-05-07T19:46:12.9199111Z 2025-05-07T19:46:12.9199129Z 2025-05-07T19:46:12.9199133Z 2025-05-07T19:46:12.9199137Z 2025-05-07T19:46:12.9199140Z 2025-05-07T19:46:12.9199144Z 2025-05-07T19:46:12.9199148Z 2025-05-07T19:46:12.9199152Z 2025-05-07T19:46:12.9263844Z cuda-nvdisasm-12.6.7 | 47.6 MB | #####1 | 51%  2025-05-07T19:46:12.9265220Z 2025-05-07T19:46:12.9265259Z 2025-05-07T19:46:12.9265271Z 2025-05-07T19:46:12.9906158Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:12.9907096Z 2025-05-07T19:46:12.9907111Z 2025-05-07T19:46:12.9907122Z 2025-05-07T19:46:12.9907133Z 2025-05-07T19:46:12.9907144Z 2025-05-07T19:46:12.9907155Z 2025-05-07T19:46:12.9907166Z 2025-05-07T19:46:12.9907176Z 2025-05-07T19:46:12.9907186Z 2025-05-07T19:46:13.0198793Z libcurand-10.3.7.77 | 39.9 MB | #######5 | 75%  2025-05-07T19:46:13.0199150Z 2025-05-07T19:46:13.0199155Z 2025-05-07T19:46:13.0199176Z 2025-05-07T19:46:13.0199180Z 2025-05-07T19:46:13.0199183Z 2025-05-07T19:46:13.0199187Z 2025-05-07T19:46:13.0199191Z 2025-05-07T19:46:13.0199194Z 2025-05-07T19:46:13.0337184Z cuda-nvdisasm-12.6.7 | 47.6 MB | ######3 | 64%  2025-05-07T19:46:13.0337597Z 2025-05-07T19:46:13.0337603Z 2025-05-07T19:46:13.0337606Z 2025-05-07T19:46:13.0337610Z 2025-05-07T19:46:13.0337614Z 2025-05-07T19:46:13.0337617Z 2025-05-07T19:46:13.0337621Z 2025-05-07T19:46:13.0695049Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:13.0695458Z 2025-05-07T19:46:13.0695609Z 2025-05-07T19:46:13.0695613Z 2025-05-07T19:46:13.0695713Z 2025-05-07T19:46:13.0695806Z 2025-05-07T19:46:13.0695813Z 2025-05-07T19:46:13.0695818Z 2025-05-07T19:46:13.0695823Z 2025-05-07T19:46:13.0695828Z 2025-05-07T19:46:13.0695832Z 2025-05-07T19:46:13.0907621Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:13.0907984Z 2025-05-07T19:46:13.0908083Z 2025-05-07T19:46:13.0908087Z 2025-05-07T19:46:13.0908091Z 2025-05-07T19:46:13.0908099Z 2025-05-07T19:46:13.0908103Z 2025-05-07T19:46:13.0908106Z 2025-05-07T19:46:13.0908110Z 2025-05-07T19:46:13.0908129Z 2025-05-07T19:46:13.1199503Z libcurand-10.3.7.77 | 39.9 MB | #########1 | 92%  2025-05-07T19:46:13.1199855Z 2025-05-07T19:46:13.1199859Z 2025-05-07T19:46:13.1199863Z 2025-05-07T19:46:13.1199866Z 2025-05-07T19:46:13.1199870Z 2025-05-07T19:46:13.1199873Z 2025-05-07T19:46:13.1199890Z 2025-05-07T19:46:13.1199893Z 2025-05-07T19:46:13.1698486Z cuda-nvdisasm-12.6.7 | 47.6 MB | #######7 | 78%  2025-05-07T19:46:13.1698903Z 2025-05-07T19:46:13.1699040Z 2025-05-07T19:46:13.1699044Z 2025-05-07T19:46:13.1699047Z 2025-05-07T19:46:13.1699051Z 2025-05-07T19:46:13.1699055Z 2025-05-07T19:46:13.1699058Z 2025-05-07T19:46:13.1699062Z 2025-05-07T19:46:13.1699066Z 2025-05-07T19:46:13.1699087Z 2025-05-07T19:46:13.2328804Z gds-tools-1.11.1.6 | 37.8 MB | #4 | 15%  2025-05-07T19:46:13.2329180Z 2025-05-07T19:46:13.2329283Z 2025-05-07T19:46:13.2329287Z 2025-05-07T19:46:13.2329292Z 2025-05-07T19:46:13.2329309Z 2025-05-07T19:46:13.2329313Z 2025-05-07T19:46:13.2329438Z 2025-05-07T19:46:13.2329442Z 2025-05-07T19:46:13.2700780Z cuda-nvdisasm-12.6.7 | 47.6 MB | ######### | 90%  2025-05-07T19:46:13.2701442Z 2025-05-07T19:46:13.2701455Z 2025-05-07T19:46:13.2701461Z 2025-05-07T19:46:13.2701487Z 2025-05-07T19:46:13.2701511Z 2025-05-07T19:46:13.2701515Z 2025-05-07T19:46:13.2701519Z 2025-05-07T19:46:13.2701526Z 2025-05-07T19:46:13.2701531Z 2025-05-07T19:46:13.2701536Z 2025-05-07T19:46:13.3701721Z gds-tools-1.11.1.6 | 37.8 MB | ###4 | 34%  2025-05-07T19:46:13.3702148Z 2025-05-07T19:46:13.3702297Z 2025-05-07T19:46:13.3702301Z 2025-05-07T19:46:13.3702305Z 2025-05-07T19:46:13.3702309Z 2025-05-07T19:46:13.3702327Z 2025-05-07T19:46:13.3702331Z 2025-05-07T19:46:13.3702335Z 2025-05-07T19:46:13.3702338Z 2025-05-07T19:46:13.3702342Z 2025-05-07T19:46:13.3907602Z gds-tools-1.11.1.6 | 37.8 MB | #####5 | 56%  2025-05-07T19:46:13.3908015Z 2025-05-07T19:46:13.3908074Z 2025-05-07T19:46:13.4704861Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:13.4705229Z 2025-05-07T19:46:13.4705400Z 2025-05-07T19:46:13.4705404Z 2025-05-07T19:46:13.4705418Z 2025-05-07T19:46:13.4705422Z 2025-05-07T19:46:13.4705471Z 2025-05-07T19:46:13.4705475Z 2025-05-07T19:46:13.4705490Z 2025-05-07T19:46:13.4705493Z 2025-05-07T19:46:13.4705563Z 2025-05-07T19:46:13.6066154Z gds-tools-1.11.1.6 | 37.8 MB | #######8 | 79%  2025-05-07T19:46:13.6066482Z 2025-05-07T19:46:13.6547913Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:13.6548260Z 2025-05-07T19:46:13.6548265Z 2025-05-07T19:46:13.6548269Z 2025-05-07T19:46:13.6548288Z 2025-05-07T19:46:13.6548292Z 2025-05-07T19:46:13.6548295Z 2025-05-07T19:46:13.6548299Z 2025-05-07T19:46:13.6548302Z 2025-05-07T19:46:13.6548305Z 2025-05-07T19:46:13.6548309Z 2025-05-07T19:46:13.6548312Z 2025-05-07T19:46:13.7133499Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:13.7133865Z 2025-05-07T19:46:13.7133870Z 2025-05-07T19:46:13.7133873Z 2025-05-07T19:46:13.7133889Z 2025-05-07T19:46:13.7133892Z 2025-05-07T19:46:13.7133896Z 2025-05-07T19:46:13.7133899Z 2025-05-07T19:46:13.7133903Z 2025-05-07T19:46:13.7133906Z 2025-05-07T19:46:13.7547356Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:13.7547686Z 2025-05-07T19:46:13.7547691Z 2025-05-07T19:46:13.7547695Z 2025-05-07T19:46:13.7547710Z 2025-05-07T19:46:13.7547714Z 2025-05-07T19:46:13.7547717Z 2025-05-07T19:46:13.7547721Z 2025-05-07T19:46:13.7547724Z 2025-05-07T19:46:13.7547728Z 2025-05-07T19:46:13.7547747Z 2025-05-07T19:46:13.7547751Z 2025-05-07T19:46:13.7692007Z cuda-nvcc-tools-12.6 | 23.0 MB | ###8 | 39%  2025-05-07T19:46:13.7692485Z 2025-05-07T19:46:13.7692504Z 2025-05-07T19:46:13.7692509Z 2025-05-07T19:46:13.7692514Z 2025-05-07T19:46:13.7692535Z 2025-05-07T19:46:13.7692539Z 2025-05-07T19:46:13.7692544Z 2025-05-07T19:46:13.7692551Z 2025-05-07T19:46:13.7692584Z 2025-05-07T19:46:13.7692588Z 2025-05-07T19:46:13.7692596Z 2025-05-07T19:46:13.7692600Z 2025-05-07T19:46:13.8551167Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:13.8551693Z 2025-05-07T19:46:13.8551698Z 2025-05-07T19:46:13.8551701Z 2025-05-07T19:46:13.8551705Z 2025-05-07T19:46:13.8551709Z 2025-05-07T19:46:13.8551712Z 2025-05-07T19:46:13.8551716Z 2025-05-07T19:46:13.8551719Z 2025-05-07T19:46:13.8551723Z 2025-05-07T19:46:13.8551726Z 2025-05-07T19:46:13.8551730Z 2025-05-07T19:46:13.8715440Z cuda-nvcc-tools-12.6 | 23.0 MB | #######3 | 74%  2025-05-07T19:46:13.8715805Z 2025-05-07T19:46:13.8715826Z 2025-05-07T19:46:13.8715830Z 2025-05-07T19:46:13.8715834Z 2025-05-07T19:46:13.8715851Z 2025-05-07T19:46:13.8715855Z 2025-05-07T19:46:13.8715858Z 2025-05-07T19:46:13.8715861Z 2025-05-07T19:46:13.8715978Z 2025-05-07T19:46:13.8715982Z 2025-05-07T19:46:13.8715986Z 2025-05-07T19:46:13.8715992Z 2025-05-07T19:46:13.9715827Z cuda-nvrtc-12.6.85 | 17.3 MB | ##6 | 26%  2025-05-07T19:46:13.9716184Z 2025-05-07T19:46:13.9716190Z 2025-05-07T19:46:13.9716193Z 2025-05-07T19:46:13.9716197Z 2025-05-07T19:46:13.9716201Z 2025-05-07T19:46:13.9716204Z 2025-05-07T19:46:13.9716208Z 2025-05-07T19:46:13.9716211Z 2025-05-07T19:46:13.9716215Z 2025-05-07T19:46:13.9716219Z 2025-05-07T19:46:13.9716222Z 2025-05-07T19:46:13.9716989Z 2025-05-07T19:46:14.0068734Z cuda-nvrtc-12.6.85 | 17.3 MB | ######9 | 69%  2025-05-07T19:46:14.0069108Z 2025-05-07T19:46:14.0069114Z 2025-05-07T19:46:14.0069118Z 2025-05-07T19:46:14.0069122Z 2025-05-07T19:46:14.0069125Z 2025-05-07T19:46:14.0069129Z 2025-05-07T19:46:14.0069132Z 2025-05-07T19:46:14.0069136Z 2025-05-07T19:46:14.0489562Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:14.0489922Z 2025-05-07T19:46:14.0489927Z 2025-05-07T19:46:14.0489947Z 2025-05-07T19:46:14.0489951Z 2025-05-07T19:46:14.0489955Z 2025-05-07T19:46:14.0489958Z 2025-05-07T19:46:14.0489962Z 2025-05-07T19:46:14.0489965Z 2025-05-07T19:46:14.0489969Z 2025-05-07T19:46:14.0489972Z 2025-05-07T19:46:14.0489976Z 2025-05-07T19:46:14.0489979Z 2025-05-07T19:46:14.0489983Z 2025-05-07T19:46:14.1114618Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:14.1114967Z 2025-05-07T19:46:14.1114972Z 2025-05-07T19:46:14.1114975Z 2025-05-07T19:46:14.1114979Z 2025-05-07T19:46:14.1114983Z 2025-05-07T19:46:14.1115000Z 2025-05-07T19:46:14.1115003Z 2025-05-07T19:46:14.1115007Z 2025-05-07T19:46:14.1115010Z 2025-05-07T19:46:14.1116307Z 2025-05-07T19:46:14.1118798Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:14.1119099Z 2025-05-07T19:46:14.1119111Z 2025-05-07T19:46:14.1119123Z 2025-05-07T19:46:14.1119127Z 2025-05-07T19:46:14.1119131Z 2025-05-07T19:46:14.1119134Z 2025-05-07T19:46:14.1119138Z 2025-05-07T19:46:14.1119141Z 2025-05-07T19:46:14.1119144Z 2025-05-07T19:46:14.1120310Z 2025-05-07T19:46:14.1555758Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:14.1556098Z 2025-05-07T19:46:14.1556103Z 2025-05-07T19:46:14.1556107Z 2025-05-07T19:46:14.1556110Z 2025-05-07T19:46:14.1556114Z 2025-05-07T19:46:14.1556117Z 2025-05-07T19:46:14.1556121Z 2025-05-07T19:46:14.1556124Z 2025-05-07T19:46:14.1556128Z 2025-05-07T19:46:14.1556145Z 2025-05-07T19:46:14.1556149Z 2025-05-07T19:46:14.1556152Z 2025-05-07T19:46:14.1556169Z 2025-05-07T19:46:14.1556173Z 2025-05-07T19:46:14.1599556Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:14.1599923Z 2025-05-07T19:46:14.1599928Z 2025-05-07T19:46:14.1599946Z 2025-05-07T19:46:14.1599950Z 2025-05-07T19:46:14.1599966Z 2025-05-07T19:46:14.1599970Z 2025-05-07T19:46:14.1599973Z 2025-05-07T19:46:14.1599976Z 2025-05-07T19:46:14.1599980Z 2025-05-07T19:46:14.1599984Z 2025-05-07T19:46:14.1599987Z 2025-05-07T19:46:14.1599991Z 2025-05-07T19:46:14.1599994Z 2025-05-07T19:46:14.2168570Z libnvjitlink-12.6.85 | 14.9 MB | ###7 | 38%  2025-05-07T19:46:14.2168952Z 2025-05-07T19:46:14.2168958Z 2025-05-07T19:46:14.2168962Z 2025-05-07T19:46:14.2168966Z 2025-05-07T19:46:14.2168969Z 2025-05-07T19:46:14.2168973Z 2025-05-07T19:46:14.2168976Z 2025-05-07T19:46:14.2168980Z 2025-05-07T19:46:14.2168983Z 2025-05-07T19:46:14.2168987Z 2025-05-07T19:46:14.2168990Z 2025-05-07T19:46:14.2345793Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:14.2346178Z 2025-05-07T19:46:14.2346183Z 2025-05-07T19:46:14.2346187Z 2025-05-07T19:46:14.2346190Z 2025-05-07T19:46:14.2346194Z 2025-05-07T19:46:14.2346197Z 2025-05-07T19:46:14.2346317Z 2025-05-07T19:46:14.2346320Z 2025-05-07T19:46:14.2346324Z 2025-05-07T19:46:14.2346327Z 2025-05-07T19:46:14.2346331Z 2025-05-07T19:46:14.2346334Z 2025-05-07T19:46:14.2346637Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:14.2346960Z 2025-05-07T19:46:14.2346964Z 2025-05-07T19:46:14.2346967Z 2025-05-07T19:46:14.2346971Z 2025-05-07T19:46:14.2346974Z 2025-05-07T19:46:14.2346978Z 2025-05-07T19:46:14.2346981Z 2025-05-07T19:46:14.2346985Z 2025-05-07T19:46:14.2346988Z 2025-05-07T19:46:14.2346992Z 2025-05-07T19:46:14.2346995Z 2025-05-07T19:46:14.2346999Z 2025-05-07T19:46:14.2556147Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:14.2556510Z 2025-05-07T19:46:14.2556515Z 2025-05-07T19:46:14.2556519Z 2025-05-07T19:46:14.2556523Z 2025-05-07T19:46:14.2556526Z 2025-05-07T19:46:14.2556530Z 2025-05-07T19:46:14.2556533Z 2025-05-07T19:46:14.2556537Z 2025-05-07T19:46:14.2556546Z 2025-05-07T19:46:14.2556550Z 2025-05-07T19:46:14.2556553Z 2025-05-07T19:46:14.2556557Z 2025-05-07T19:46:14.2556560Z 2025-05-07T19:46:14.2556564Z 2025-05-07T19:46:14.2611917Z cuda-nvcc-dev_linux- | 10.8 MB | #######6 | 77%  2025-05-07T19:46:14.2612287Z 2025-05-07T19:46:14.2612292Z 2025-05-07T19:46:14.2612295Z 2025-05-07T19:46:14.2612299Z 2025-05-07T19:46:14.2612302Z 2025-05-07T19:46:14.2612306Z 2025-05-07T19:46:14.2612309Z 2025-05-07T19:46:14.2612313Z 2025-05-07T19:46:14.2612317Z 2025-05-07T19:46:14.2612320Z 2025-05-07T19:46:14.2612337Z 2025-05-07T19:46:14.2612340Z 2025-05-07T19:46:14.2612344Z 2025-05-07T19:46:14.2612347Z 2025-05-07T19:46:14.2612363Z 2025-05-07T19:46:14.2736134Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:14.2736502Z 2025-05-07T19:46:14.2736507Z 2025-05-07T19:46:14.2736526Z 2025-05-07T19:46:14.2736529Z 2025-05-07T19:46:14.2736533Z 2025-05-07T19:46:14.2736549Z 2025-05-07T19:46:14.2736552Z 2025-05-07T19:46:14.2736556Z 2025-05-07T19:46:14.2736560Z 2025-05-07T19:46:14.2736563Z 2025-05-07T19:46:14.2736567Z 2025-05-07T19:46:14.2736570Z 2025-05-07T19:46:14.2736574Z 2025-05-07T19:46:14.2736578Z 2025-05-07T19:46:14.2736581Z 2025-05-07T19:46:14.2736585Z 2025-05-07T19:46:14.2758838Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:14.2759222Z 2025-05-07T19:46:14.2759225Z 2025-05-07T19:46:14.2759229Z 2025-05-07T19:46:14.2759232Z 2025-05-07T19:46:14.2759236Z 2025-05-07T19:46:14.2759239Z 2025-05-07T19:46:14.2759242Z 2025-05-07T19:46:14.2759246Z 2025-05-07T19:46:14.2759249Z 2025-05-07T19:46:14.2759261Z 2025-05-07T19:46:14.2759265Z 2025-05-07T19:46:14.2759268Z 2025-05-07T19:46:14.2759276Z 2025-05-07T19:46:14.3615266Z libnvjitlink-12.6.85 | 14.9 MB | #####7 | 58%  2025-05-07T19:46:14.3615637Z 2025-05-07T19:46:14.3615642Z 2025-05-07T19:46:14.3615660Z 2025-05-07T19:46:14.3615664Z 2025-05-07T19:46:14.3615668Z 2025-05-07T19:46:14.3615672Z 2025-05-07T19:46:14.3615675Z 2025-05-07T19:46:14.3615679Z 2025-05-07T19:46:14.3615683Z 2025-05-07T19:46:14.3615687Z 2025-05-07T19:46:14.3615691Z 2025-05-07T19:46:14.3615708Z 2025-05-07T19:46:14.3615712Z 2025-05-07T19:46:14.3615716Z 2025-05-07T19:46:14.3615719Z 2025-05-07T19:46:14.3739918Z cuda-nvvm-tools-12.6 | 10.4 MB | #####5 | 56%  2025-05-07T19:46:14.3740289Z 2025-05-07T19:46:14.3740294Z 2025-05-07T19:46:14.3740298Z 2025-05-07T19:46:14.3740314Z 2025-05-07T19:46:14.3740318Z 2025-05-07T19:46:14.3740322Z 2025-05-07T19:46:14.3740529Z 2025-05-07T19:46:14.3740534Z 2025-05-07T19:46:14.3740538Z 2025-05-07T19:46:14.3740541Z 2025-05-07T19:46:14.3740545Z 2025-05-07T19:46:14.3740548Z 2025-05-07T19:46:14.3740552Z 2025-05-07T19:46:14.3740555Z 2025-05-07T19:46:14.3740559Z 2025-05-07T19:46:14.3740563Z 2025-05-07T19:46:14.3809202Z cuda-sanitizer-api-1 | 8.9 MB | #######3 | 74%  2025-05-07T19:46:14.3809812Z 2025-05-07T19:46:14.3809818Z 2025-05-07T19:46:14.3809822Z 2025-05-07T19:46:14.3809826Z 2025-05-07T19:46:14.3809829Z 2025-05-07T19:46:14.3809833Z 2025-05-07T19:46:14.3809836Z 2025-05-07T19:46:14.3809840Z 2025-05-07T19:46:14.3809843Z 2025-05-07T19:46:14.3809847Z 2025-05-07T19:46:14.3809850Z 2025-05-07T19:46:14.3809854Z 2025-05-07T19:46:14.3809857Z 2025-05-07T19:46:14.5241804Z libnvjitlink-12.6.85 | 14.9 MB | #######8 | 78%  2025-05-07T19:46:14.5242165Z 2025-05-07T19:46:14.5242170Z 2025-05-07T19:46:14.5242174Z 2025-05-07T19:46:14.5242195Z 2025-05-07T19:46:14.5242198Z 2025-05-07T19:46:14.5242202Z 2025-05-07T19:46:14.5242205Z 2025-05-07T19:46:14.5242209Z 2025-05-07T19:46:14.5242212Z 2025-05-07T19:46:14.5242216Z 2025-05-07T19:46:14.5242220Z 2025-05-07T19:46:14.5242224Z 2025-05-07T19:46:14.5242228Z 2025-05-07T19:46:14.5242257Z 2025-05-07T19:46:14.5242260Z 2025-05-07T19:46:14.5242403Z 2025-05-07T19:46:14.5442260Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:14.5442648Z 2025-05-07T19:46:14.5442654Z 2025-05-07T19:46:14.5442657Z 2025-05-07T19:46:14.5442675Z 2025-05-07T19:46:14.5442678Z 2025-05-07T19:46:14.5442682Z 2025-05-07T19:46:14.5442685Z 2025-05-07T19:46:14.5442689Z 2025-05-07T19:46:14.5442692Z 2025-05-07T19:46:14.5442696Z 2025-05-07T19:46:14.5442700Z 2025-05-07T19:46:14.5442703Z 2025-05-07T19:46:14.5442707Z 2025-05-07T19:46:14.5442710Z 2025-05-07T19:46:14.5554369Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:14.5554761Z 2025-05-07T19:46:14.5554765Z 2025-05-07T19:46:14.5554769Z 2025-05-07T19:46:14.5554772Z 2025-05-07T19:46:14.5554776Z 2025-05-07T19:46:14.5554779Z 2025-05-07T19:46:14.5554783Z 2025-05-07T19:46:14.5554787Z 2025-05-07T19:46:14.5554790Z 2025-05-07T19:46:14.5554800Z 2025-05-07T19:46:14.5554803Z 2025-05-07T19:46:14.5554807Z 2025-05-07T19:46:14.5554810Z 2025-05-07T19:46:14.5554814Z 2025-05-07T19:46:14.5554817Z 2025-05-07T19:46:14.5571790Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:14.5572209Z 2025-05-07T19:46:14.5572214Z 2025-05-07T19:46:14.5572230Z 2025-05-07T19:46:14.5572234Z 2025-05-07T19:46:14.5572237Z 2025-05-07T19:46:14.5572241Z 2025-05-07T19:46:14.5572244Z 2025-05-07T19:46:14.5572248Z 2025-05-07T19:46:14.5572251Z 2025-05-07T19:46:14.5572255Z 2025-05-07T19:46:14.5572258Z 2025-05-07T19:46:14.5572262Z 2025-05-07T19:46:14.5572265Z 2025-05-07T19:46:14.5572269Z 2025-05-07T19:46:14.5572285Z 2025-05-07T19:46:14.5756192Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:14.5756614Z 2025-05-07T19:46:14.5756620Z 2025-05-07T19:46:14.5756623Z 2025-05-07T19:46:14.5756627Z 2025-05-07T19:46:14.5756645Z 2025-05-07T19:46:14.5756648Z 2025-05-07T19:46:14.5756652Z 2025-05-07T19:46:14.5756656Z 2025-05-07T19:46:14.5756659Z 2025-05-07T19:46:14.5756663Z 2025-05-07T19:46:14.5756666Z 2025-05-07T19:46:14.5756670Z 2025-05-07T19:46:14.5756673Z 2025-05-07T19:46:14.5756677Z 2025-05-07T19:46:14.5756680Z 2025-05-07T19:46:14.5756684Z 2025-05-07T19:46:14.5756687Z 2025-05-07T19:46:14.5756708Z 2025-05-07T19:46:14.5891509Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:14.5891878Z 2025-05-07T19:46:14.5892044Z 2025-05-07T19:46:14.5892052Z 2025-05-07T19:46:14.5892057Z 2025-05-07T19:46:14.5892062Z 2025-05-07T19:46:14.5892067Z 2025-05-07T19:46:14.5892267Z 2025-05-07T19:46:14.5892272Z 2025-05-07T19:46:14.5892276Z 2025-05-07T19:46:14.5892281Z 2025-05-07T19:46:14.5892285Z 2025-05-07T19:46:14.5892290Z 2025-05-07T19:46:14.5892294Z 2025-05-07T19:46:14.5892299Z 2025-05-07T19:46:14.5892304Z 2025-05-07T19:46:14.5892308Z 2025-05-07T19:46:14.5892414Z 2025-05-07T19:46:14.5996529Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:14.5996895Z 2025-05-07T19:46:14.5996899Z 2025-05-07T19:46:14.5996903Z 2025-05-07T19:46:14.5996907Z 2025-05-07T19:46:14.5996910Z 2025-05-07T19:46:14.5996914Z 2025-05-07T19:46:14.5996917Z 2025-05-07T19:46:14.5996921Z 2025-05-07T19:46:14.5996925Z 2025-05-07T19:46:14.5996928Z 2025-05-07T19:46:14.5996932Z 2025-05-07T19:46:14.5996935Z 2025-05-07T19:46:14.5996939Z 2025-05-07T19:46:14.5996942Z 2025-05-07T19:46:14.5996945Z 2025-05-07T19:46:14.5996949Z 2025-05-07T19:46:14.5996952Z 2025-05-07T19:46:14.5996956Z 2025-05-07T19:46:14.5996959Z 2025-05-07T19:46:14.6302845Z ... (more hidden) ... 2025-05-07T19:46:14.6303167Z 2025-05-07T19:46:14.6303172Z 2025-05-07T19:46:14.6303176Z 2025-05-07T19:46:14.6303180Z 2025-05-07T19:46:14.6303184Z 2025-05-07T19:46:14.6303188Z 2025-05-07T19:46:14.6303202Z 2025-05-07T19:46:14.6303206Z 2025-05-07T19:46:14.6303223Z 2025-05-07T19:46:14.6303227Z 2025-05-07T19:46:14.6303230Z 2025-05-07T19:46:14.6303234Z 2025-05-07T19:46:14.6303354Z 2025-05-07T19:46:14.6303931Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:14.6304367Z 2025-05-07T19:46:14.6304372Z 2025-05-07T19:46:14.6304375Z 2025-05-07T19:46:14.6304379Z 2025-05-07T19:46:14.6304383Z 2025-05-07T19:46:14.6304387Z 2025-05-07T19:46:14.6304391Z 2025-05-07T19:46:14.6304395Z 2025-05-07T19:46:14.6304399Z 2025-05-07T19:46:14.6304402Z 2025-05-07T19:46:14.6304406Z 2025-05-07T19:46:14.6304410Z 2025-05-07T19:46:14.6304413Z 2025-05-07T19:46:14.6630435Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:14.6630803Z 2025-05-07T19:46:14.6630808Z 2025-05-07T19:46:14.6630812Z 2025-05-07T19:46:14.6630815Z 2025-05-07T19:46:14.6630819Z 2025-05-07T19:46:14.6630823Z 2025-05-07T19:46:14.6630846Z 2025-05-07T19:46:14.6630850Z 2025-05-07T19:46:14.6630854Z 2025-05-07T19:46:14.6630857Z 2025-05-07T19:46:14.6630861Z 2025-05-07T19:46:14.6630864Z 2025-05-07T19:46:14.6630867Z 2025-05-07T19:46:14.6630871Z 2025-05-07T19:46:14.6630874Z 2025-05-07T19:46:14.6630878Z 2025-05-07T19:46:14.6630881Z 2025-05-07T19:46:14.6630885Z 2025-05-07T19:46:14.6675786Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:14.6676160Z 2025-05-07T19:46:14.6676165Z 2025-05-07T19:46:14.6676168Z 2025-05-07T19:46:14.6676172Z 2025-05-07T19:46:14.6676175Z 2025-05-07T19:46:14.6676179Z 2025-05-07T19:46:14.6676182Z 2025-05-07T19:46:14.6676199Z 2025-05-07T19:46:14.6676214Z 2025-05-07T19:46:14.6676218Z 2025-05-07T19:46:14.6676221Z 2025-05-07T19:46:14.6676225Z 2025-05-07T19:46:14.6676228Z 2025-05-07T19:46:14.6676232Z 2025-05-07T19:46:14.6676235Z 2025-05-07T19:46:14.6676239Z 2025-05-07T19:46:14.6676242Z 2025-05-07T19:46:14.6676246Z 2025-05-07T19:46:14.6676257Z 2025-05-07T19:46:14.6894219Z ... (more hidden) ... 2025-05-07T19:46:14.6894742Z 2025-05-07T19:46:14.6894751Z 2025-05-07T19:46:14.6894772Z 2025-05-07T19:46:14.6894803Z 2025-05-07T19:46:14.6894807Z 2025-05-07T19:46:14.6894812Z 2025-05-07T19:46:14.6894817Z 2025-05-07T19:46:14.6894822Z 2025-05-07T19:46:14.6894826Z 2025-05-07T19:46:14.6894831Z 2025-05-07T19:46:14.6894836Z 2025-05-07T19:46:14.6894840Z 2025-05-07T19:46:14.6894845Z 2025-05-07T19:46:14.6894849Z 2025-05-07T19:46:14.6894854Z 2025-05-07T19:46:14.6894859Z 2025-05-07T19:46:14.6894866Z 2025-05-07T19:46:14.8019849Z cuda-nvvm-impl-12.6. | 7.7 MB | ######## | 80%  2025-05-07T19:46:14.8020254Z 2025-05-07T19:46:14.8020259Z 2025-05-07T19:46:14.8020262Z 2025-05-07T19:46:14.8020266Z 2025-05-07T19:46:14.8020270Z 2025-05-07T19:46:14.8020273Z 2025-05-07T19:46:14.8020277Z 2025-05-07T19:46:14.8020280Z 2025-05-07T19:46:14.8020405Z 2025-05-07T19:46:14.8020409Z 2025-05-07T19:46:14.8020412Z 2025-05-07T19:46:14.8020416Z 2025-05-07T19:46:14.8020419Z 2025-05-07T19:46:14.8020423Z 2025-05-07T19:46:14.8020426Z 2025-05-07T19:46:14.8020447Z 2025-05-07T19:46:14.8020450Z 2025-05-07T19:46:14.8233274Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:14.8233650Z 2025-05-07T19:46:14.8233656Z 2025-05-07T19:46:14.8233660Z 2025-05-07T19:46:14.8233664Z 2025-05-07T19:46:14.8233667Z 2025-05-07T19:46:14.8233671Z 2025-05-07T19:46:15.1569369Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:15.1569735Z 2025-05-07T19:46:15.1569758Z 2025-05-07T19:46:15.1569762Z 2025-05-07T19:46:15.1569766Z 2025-05-07T19:46:15.1569770Z 2025-05-07T19:46:15.4465682Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:15.4466130Z 2025-05-07T19:46:15.4466202Z 2025-05-07T19:46:15.4466212Z 2025-05-07T19:46:15.4466257Z 2025-05-07T19:46:15.4466261Z 2025-05-07T19:46:15.4466282Z 2025-05-07T19:46:15.4466288Z 2025-05-07T19:46:15.5073288Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:15.5073600Z 2025-05-07T19:46:15.5073605Z 2025-05-07T19:46:15.5073608Z 2025-05-07T19:46:15.5073612Z 2025-05-07T19:46:15.5073616Z 2025-05-07T19:46:15.5073619Z 2025-05-07T19:46:15.5073623Z 2025-05-07T19:46:15.5073627Z 2025-05-07T19:46:15.5073630Z 2025-05-07T19:46:15.6768770Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:15.6769107Z 2025-05-07T19:46:15.6769112Z 2025-05-07T19:46:15.6769116Z 2025-05-07T19:46:15.6769119Z 2025-05-07T19:46:15.6769138Z 2025-05-07T19:46:15.6769142Z 2025-05-07T19:46:15.6769145Z 2025-05-07T19:46:15.6769165Z 2025-05-07T19:46:15.7836759Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:15.7837108Z 2025-05-07T19:46:15.7837113Z 2025-05-07T19:46:15.7837131Z 2025-05-07T19:46:15.7837135Z 2025-05-07T19:46:15.7837139Z 2025-05-07T19:46:15.7837143Z 2025-05-07T19:46:15.7837160Z 2025-05-07T19:46:15.7837164Z 2025-05-07T19:46:15.7837167Z 2025-05-07T19:46:15.7837171Z 2025-05-07T19:46:16.0197152Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:16.0197507Z 2025-05-07T19:46:16.0197512Z 2025-05-07T19:46:16.0197515Z 2025-05-07T19:46:16.0197519Z 2025-05-07T19:46:16.0197522Z 2025-05-07T19:46:16.0197526Z 2025-05-07T19:46:16.0197530Z 2025-05-07T19:46:16.0197548Z 2025-05-07T19:46:16.0197551Z 2025-05-07T19:46:16.0197554Z 2025-05-07T19:46:16.0197558Z 2025-05-07T19:46:16.0197562Z 2025-05-07T19:46:16.0339155Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:16.0339506Z 2025-05-07T19:46:16.0339511Z 2025-05-07T19:46:16.0339528Z 2025-05-07T19:46:16.0339532Z 2025-05-07T19:46:16.0339536Z 2025-05-07T19:46:16.0339539Z 2025-05-07T19:46:16.0339543Z 2025-05-07T19:46:16.0339554Z 2025-05-07T19:46:16.0339558Z 2025-05-07T19:46:16.0339562Z 2025-05-07T19:46:16.0339565Z 2025-05-07T19:46:16.2056390Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:16.2056752Z 2025-05-07T19:46:16.2056758Z 2025-05-07T19:46:16.2056762Z 2025-05-07T19:46:16.2056766Z 2025-05-07T19:46:16.2056769Z 2025-05-07T19:46:16.2056773Z 2025-05-07T19:46:16.2056777Z 2025-05-07T19:46:16.2056780Z 2025-05-07T19:46:16.2056784Z 2025-05-07T19:46:16.2056787Z 2025-05-07T19:46:16.2056791Z 2025-05-07T19:46:16.2056794Z 2025-05-07T19:46:16.2056798Z 2025-05-07T19:46:16.2056814Z 2025-05-07T19:46:16.2056818Z 2025-05-07T19:46:16.2056821Z 2025-05-07T19:46:16.2639166Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:16.2639573Z 2025-05-07T19:46:16.2639578Z 2025-05-07T19:46:16.2639582Z 2025-05-07T19:46:16.2639586Z 2025-05-07T19:46:16.2639603Z 2025-05-07T19:46:16.2639721Z 2025-05-07T19:46:16.2639725Z 2025-05-07T19:46:16.2639728Z 2025-05-07T19:46:16.2639732Z 2025-05-07T19:46:16.2639735Z 2025-05-07T19:46:16.2639739Z 2025-05-07T19:46:16.2639742Z 2025-05-07T19:46:16.2639746Z 2025-05-07T19:46:16.2639749Z 2025-05-07T19:46:16.3633734Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:16.3634125Z 2025-05-07T19:46:16.3634129Z 2025-05-07T19:46:16.3634133Z 2025-05-07T19:46:16.3634137Z 2025-05-07T19:46:16.3634141Z 2025-05-07T19:46:16.3634144Z 2025-05-07T19:46:16.3634148Z 2025-05-07T19:46:16.3634151Z 2025-05-07T19:46:16.3634155Z 2025-05-07T19:46:16.3634159Z 2025-05-07T19:46:16.3634162Z 2025-05-07T19:46:16.3634178Z 2025-05-07T19:46:16.3634182Z 2025-05-07T19:46:16.3634185Z 2025-05-07T19:46:16.3634188Z 2025-05-07T19:46:16.4671027Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:16.4671508Z 2025-05-07T19:46:16.4671512Z 2025-05-07T19:46:16.4671530Z 2025-05-07T19:46:16.4671534Z 2025-05-07T19:46:16.4671537Z 2025-05-07T19:46:16.4671541Z 2025-05-07T19:46:16.4671544Z 2025-05-07T19:46:16.4671548Z 2025-05-07T19:46:16.4671551Z 2025-05-07T19:46:16.4671555Z 2025-05-07T19:46:16.4671558Z 2025-05-07T19:46:16.4671562Z 2025-05-07T19:46:16.4671566Z 2025-05-07T19:46:16.4671569Z 2025-05-07T19:46:16.4671573Z 2025-05-07T19:46:16.4671576Z 2025-05-07T19:46:16.4671580Z 2025-05-07T19:46:16.4671583Z 2025-05-07T19:46:16.4671948Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:16.4672430Z 2025-05-07T19:46:16.4672434Z 2025-05-07T19:46:16.4672437Z 2025-05-07T19:46:16.4672441Z 2025-05-07T19:46:16.4672449Z 2025-05-07T19:46:16.4672453Z 2025-05-07T19:46:16.4672457Z 2025-05-07T19:46:16.4672460Z 2025-05-07T19:46:16.4672463Z 2025-05-07T19:46:16.4672467Z 2025-05-07T19:46:16.4672470Z 2025-05-07T19:46:16.4672474Z 2025-05-07T19:46:16.4672478Z 2025-05-07T19:46:16.4672481Z 2025-05-07T19:46:16.4672505Z 2025-05-07T19:46:16.4672508Z 2025-05-07T19:46:16.4672512Z 2025-05-07T19:46:16.4672515Z 2025-05-07T19:46:16.4799509Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:16.4799893Z 2025-05-07T19:46:16.4799898Z 2025-05-07T19:46:16.4799901Z 2025-05-07T19:46:16.4799921Z 2025-05-07T19:46:16.4799925Z 2025-05-07T19:46:16.4799928Z 2025-05-07T19:46:16.4799932Z 2025-05-07T19:46:16.4799935Z 2025-05-07T19:46:16.4799939Z 2025-05-07T19:46:16.4799943Z 2025-05-07T19:46:16.4799946Z 2025-05-07T19:46:16.4799950Z 2025-05-07T19:46:16.4799954Z 2025-05-07T19:46:16.5282975Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:16.5283362Z 2025-05-07T19:46:16.5283367Z 2025-05-07T19:46:16.5283371Z 2025-05-07T19:46:16.5283374Z 2025-05-07T19:46:16.5283378Z 2025-05-07T19:46:16.5283381Z 2025-05-07T19:46:16.5283385Z 2025-05-07T19:46:16.5283388Z 2025-05-07T19:46:16.5283397Z 2025-05-07T19:46:16.5283401Z 2025-05-07T19:46:16.5283404Z 2025-05-07T19:46:16.5283408Z 2025-05-07T19:46:16.5283411Z 2025-05-07T19:46:16.5283415Z 2025-05-07T19:46:16.5283418Z 2025-05-07T19:46:16.5283422Z 2025-05-07T19:46:16.5283425Z 2025-05-07T19:46:16.5283429Z 2025-05-07T19:46:16.5283432Z 2025-05-07T19:46:16.5283701Z ... (more hidden) ... 2025-05-07T19:46:16.5283995Z 2025-05-07T19:46:16.5283999Z 2025-05-07T19:46:16.5284002Z 2025-05-07T19:46:16.5284005Z 2025-05-07T19:46:16.5284009Z 2025-05-07T19:46:16.5284012Z 2025-05-07T19:46:16.5284016Z 2025-05-07T19:46:16.5284019Z 2025-05-07T19:46:16.5284023Z 2025-05-07T19:46:16.5284026Z 2025-05-07T19:46:16.5284211Z 2025-05-07T19:46:16.5284215Z 2025-05-07T19:46:16.5284219Z 2025-05-07T19:46:16.5284222Z 2025-05-07T19:46:16.5284226Z 2025-05-07T19:46:16.5284230Z 2025-05-07T19:46:16.5284233Z 2025-05-07T19:46:16.5284236Z 2025-05-07T19:46:16.5284240Z 2025-05-07T19:46:16.7147445Z ... (more hidden) ... 2025-05-07T19:46:16.8367919Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:16.8368768Z 2025-05-07T19:46:16.8368783Z 2025-05-07T19:46:16.8368794Z 2025-05-07T19:46:16.8368804Z 2025-05-07T19:46:16.8368815Z 2025-05-07T19:46:16.8368826Z 2025-05-07T19:46:16.8368837Z 2025-05-07T19:46:16.8368848Z 2025-05-07T19:46:16.8368858Z 2025-05-07T19:46:16.8368869Z 2025-05-07T19:46:16.8368880Z 2025-05-07T19:46:16.8368914Z 2025-05-07T19:46:16.8368925Z 2025-05-07T19:46:16.8368935Z 2025-05-07T19:46:16.8368945Z 2025-05-07T19:46:16.8368956Z 2025-05-07T19:46:16.8368966Z 2025-05-07T19:46:18.0434921Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:18.0436002Z 2025-05-07T19:46:20.6621053Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:20.6627743Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:20.6628622Z 2025-05-07T19:46:20.6628638Z 2025-05-07T19:46:20.6628650Z 2025-05-07T19:46:20.6628661Z 2025-05-07T19:46:20.6628672Z 2025-05-07T19:46:20.6628683Z 2025-05-07T19:46:20.6628694Z 2025-05-07T19:46:20.6628705Z 2025-05-07T19:46:20.6628715Z 2025-05-07T19:46:20.6628726Z 2025-05-07T19:46:20.6628736Z 2025-05-07T19:46:20.6628747Z 2025-05-07T19:46:20.6628757Z 2025-05-07T19:46:20.6628768Z 2025-05-07T19:46:20.6628778Z 2025-05-07T19:46:20.6628789Z 2025-05-07T19:46:20.6628799Z 2025-05-07T19:46:20.6628810Z 2025-05-07T19:46:20.6628820Z 2025-05-07T19:46:20.6629129Z 2025-05-07T19:46:20.6630204Z  2025-05-07T19:46:20.6631091Z 2025-05-07T19:46:20.6631305Z 2025-05-07T19:46:20.6631674Z  2025-05-07T19:46:20.6631895Z 2025-05-07T19:46:20.6631899Z 2025-05-07T19:46:20.6632084Z  2025-05-07T19:46:20.6632342Z 2025-05-07T19:46:20.6632346Z 2025-05-07T19:46:20.6632350Z 2025-05-07T19:46:20.6632531Z  2025-05-07T19:46:20.6632770Z 2025-05-07T19:46:20.6632774Z 2025-05-07T19:46:20.6632777Z 2025-05-07T19:46:20.6632780Z 2025-05-07T19:46:20.6632979Z  2025-05-07T19:46:20.6633202Z 2025-05-07T19:46:20.6633206Z 2025-05-07T19:46:20.6633210Z 2025-05-07T19:46:20.6633214Z 2025-05-07T19:46:20.6633217Z 2025-05-07T19:46:20.6633416Z  2025-05-07T19:46:20.6633644Z 2025-05-07T19:46:20.6633648Z 2025-05-07T19:46:20.6633656Z 2025-05-07T19:46:20.6633660Z 2025-05-07T19:46:20.6633664Z 2025-05-07T19:46:20.6633667Z 2025-05-07T19:46:20.6633856Z  2025-05-07T19:46:20.6634107Z 2025-05-07T19:46:20.6634111Z 2025-05-07T19:46:20.6634114Z 2025-05-07T19:46:20.6634122Z 2025-05-07T19:46:20.6634125Z 2025-05-07T19:46:20.6634129Z 2025-05-07T19:46:20.6634132Z 2025-05-07T19:46:20.6634406Z  2025-05-07T19:46:20.6634638Z 2025-05-07T19:46:20.6634642Z 2025-05-07T19:46:20.6634645Z 2025-05-07T19:46:20.6634649Z 2025-05-07T19:46:20.6634652Z 2025-05-07T19:46:20.6634656Z 2025-05-07T19:46:20.6634659Z 2025-05-07T19:46:20.6634663Z 2025-05-07T19:46:20.6634872Z  2025-05-07T19:46:20.6635117Z 2025-05-07T19:46:20.6635121Z 2025-05-07T19:46:20.6635125Z 2025-05-07T19:46:20.6635129Z 2025-05-07T19:46:20.6635132Z 2025-05-07T19:46:20.6635367Z 2025-05-07T19:46:20.6635372Z 2025-05-07T19:46:20.6635376Z 2025-05-07T19:46:20.6635379Z 2025-05-07T19:46:20.6635605Z  2025-05-07T19:46:20.6635841Z 2025-05-07T19:46:20.6635845Z 2025-05-07T19:46:20.6635956Z 2025-05-07T19:46:20.6635960Z 2025-05-07T19:46:20.6635963Z 2025-05-07T19:46:20.6635967Z 2025-05-07T19:46:20.6635970Z 2025-05-07T19:46:20.6635974Z 2025-05-07T19:46:20.6635977Z 2025-05-07T19:46:20.6635980Z 2025-05-07T19:46:20.6636199Z  2025-05-07T19:46:20.6636439Z 2025-05-07T19:46:20.6636443Z 2025-05-07T19:46:20.6636446Z 2025-05-07T19:46:20.6636450Z 2025-05-07T19:46:20.6636454Z 2025-05-07T19:46:20.6636458Z 2025-05-07T19:46:20.6636461Z 2025-05-07T19:46:20.6636465Z 2025-05-07T19:46:20.6636468Z 2025-05-07T19:46:20.6636472Z 2025-05-07T19:46:20.6636475Z 2025-05-07T19:46:20.6636705Z  2025-05-07T19:46:20.6636950Z 2025-05-07T19:46:20.6636954Z 2025-05-07T19:46:20.6636957Z 2025-05-07T19:46:20.6636961Z 2025-05-07T19:46:20.6636965Z 2025-05-07T19:46:20.6636969Z 2025-05-07T19:46:20.6636972Z 2025-05-07T19:46:20.6636980Z 2025-05-07T19:46:20.6636983Z 2025-05-07T19:46:20.6636987Z 2025-05-07T19:46:20.6636990Z 2025-05-07T19:46:20.6636994Z 2025-05-07T19:46:20.6637226Z  2025-05-07T19:46:20.6637472Z 2025-05-07T19:46:20.6637476Z 2025-05-07T19:46:20.6637479Z 2025-05-07T19:46:20.6637483Z 2025-05-07T19:46:20.6637487Z 2025-05-07T19:46:20.6637490Z 2025-05-07T19:46:20.6637494Z 2025-05-07T19:46:20.6637497Z 2025-05-07T19:46:20.6637501Z 2025-05-07T19:46:20.6637504Z 2025-05-07T19:46:20.6637528Z 2025-05-07T19:46:20.6637531Z 2025-05-07T19:46:20.6637535Z 2025-05-07T19:46:20.6637753Z  2025-05-07T19:46:20.6638009Z 2025-05-07T19:46:20.6638013Z 2025-05-07T19:46:20.6638016Z 2025-05-07T19:46:20.6638020Z 2025-05-07T19:46:20.6638023Z 2025-05-07T19:46:20.6638027Z 2025-05-07T19:46:20.6638031Z 2025-05-07T19:46:20.6638053Z 2025-05-07T19:46:20.6638057Z 2025-05-07T19:46:20.6638064Z 2025-05-07T19:46:20.6638068Z 2025-05-07T19:46:20.6638071Z 2025-05-07T19:46:20.6638075Z 2025-05-07T19:46:20.6638078Z 2025-05-07T19:46:20.6638306Z  2025-05-07T19:46:20.6638558Z 2025-05-07T19:46:20.6638562Z 2025-05-07T19:46:20.6638583Z 2025-05-07T19:46:20.6638587Z 2025-05-07T19:46:20.6638590Z 2025-05-07T19:46:20.6638594Z 2025-05-07T19:46:20.6638597Z 2025-05-07T19:46:20.6638601Z 2025-05-07T19:46:20.6638604Z 2025-05-07T19:46:20.6638608Z 2025-05-07T19:46:20.6638611Z 2025-05-07T19:46:20.6638615Z 2025-05-07T19:46:20.6638619Z 2025-05-07T19:46:20.6638623Z 2025-05-07T19:46:20.6638630Z 2025-05-07T19:46:20.6638853Z  2025-05-07T19:46:20.6639127Z 2025-05-07T19:46:20.6639130Z 2025-05-07T19:46:20.6639134Z 2025-05-07T19:46:20.6639137Z 2025-05-07T19:46:20.6639141Z 2025-05-07T19:46:20.6639148Z 2025-05-07T19:46:20.6639152Z 2025-05-07T19:46:20.6639156Z 2025-05-07T19:46:20.6639159Z 2025-05-07T19:46:20.6639163Z 2025-05-07T19:46:20.6639167Z 2025-05-07T19:46:20.6639170Z 2025-05-07T19:46:20.6639174Z 2025-05-07T19:46:20.6639177Z 2025-05-07T19:46:20.6639181Z 2025-05-07T19:46:20.6639184Z 2025-05-07T19:46:20.6639431Z  2025-05-07T19:46:20.6639693Z 2025-05-07T19:46:20.6639696Z 2025-05-07T19:46:20.6639701Z 2025-05-07T19:46:20.6639704Z 2025-05-07T19:46:20.6639708Z 2025-05-07T19:46:20.6639711Z 2025-05-07T19:46:20.6639715Z 2025-05-07T19:46:20.6639718Z 2025-05-07T19:46:20.6639722Z 2025-05-07T19:46:20.6639793Z 2025-05-07T19:46:20.6639797Z 2025-05-07T19:46:20.6639801Z 2025-05-07T19:46:20.6639804Z 2025-05-07T19:46:20.6639808Z 2025-05-07T19:46:20.6639831Z 2025-05-07T19:46:20.6639834Z 2025-05-07T19:46:20.6639838Z 2025-05-07T19:46:20.6640074Z  2025-05-07T19:46:20.6640400Z 2025-05-07T19:46:20.6640404Z 2025-05-07T19:46:20.6640407Z 2025-05-07T19:46:20.6640410Z 2025-05-07T19:46:20.6640414Z 2025-05-07T19:46:20.6640417Z 2025-05-07T19:46:20.6640421Z 2025-05-07T19:46:20.6640446Z 2025-05-07T19:46:20.6640450Z 2025-05-07T19:46:20.6640453Z 2025-05-07T19:46:20.6640457Z 2025-05-07T19:46:20.6640460Z 2025-05-07T19:46:20.6640464Z 2025-05-07T19:46:20.6640467Z 2025-05-07T19:46:20.6640471Z 2025-05-07T19:46:20.6640474Z 2025-05-07T19:46:20.6640477Z 2025-05-07T19:46:20.6640481Z 2025-05-07T19:46:20.6640730Z  2025-05-07T19:46:20.6641018Z 2025-05-07T19:46:20.6641021Z 2025-05-07T19:46:20.6641134Z  2025-05-07T19:46:20.6641259Z 2025-05-07T19:46:20.6641263Z 2025-05-07T19:46:20.6641389Z  2025-05-07T19:46:20.6641511Z 2025-05-07T19:46:20.6641515Z 2025-05-07T19:46:20.6641519Z 2025-05-07T19:46:20.6641713Z  2025-05-07T19:46:20.6641837Z 2025-05-07T19:46:20.6641841Z 2025-05-07T19:46:20.6641845Z 2025-05-07T19:46:20.6641848Z 2025-05-07T19:46:20.6641959Z  2025-05-07T19:46:20.6642112Z 2025-05-07T19:46:20.6642116Z 2025-05-07T19:46:20.6642119Z 2025-05-07T19:46:20.6642123Z 2025-05-07T19:46:20.6642126Z 2025-05-07T19:46:20.6642243Z  2025-05-07T19:46:20.6642381Z 2025-05-07T19:46:20.6642385Z 2025-05-07T19:46:20.6642408Z 2025-05-07T19:46:20.6642411Z 2025-05-07T19:46:20.6642415Z 2025-05-07T19:46:20.6642418Z 2025-05-07T19:46:20.6642539Z  2025-05-07T19:46:20.6642680Z 2025-05-07T19:46:20.6642683Z 2025-05-07T19:46:20.6642687Z 2025-05-07T19:46:20.6642694Z 2025-05-07T19:46:20.6642697Z 2025-05-07T19:46:20.6642701Z 2025-05-07T19:46:20.6642705Z 2025-05-07T19:46:20.6642847Z  2025-05-07T19:46:20.6643000Z 2025-05-07T19:46:20.6643003Z 2025-05-07T19:46:20.6643007Z 2025-05-07T19:46:20.6643010Z 2025-05-07T19:46:20.6643018Z 2025-05-07T19:46:20.6643021Z 2025-05-07T19:46:20.6643025Z 2025-05-07T19:46:20.6643028Z 2025-05-07T19:46:20.6643173Z  2025-05-07T19:46:20.6643336Z 2025-05-07T19:46:20.6643340Z 2025-05-07T19:46:20.6643344Z 2025-05-07T19:46:20.6643347Z 2025-05-07T19:46:20.6643351Z 2025-05-07T19:46:20.6643354Z 2025-05-07T19:46:20.6643358Z 2025-05-07T19:46:20.6643361Z 2025-05-07T19:46:20.6643365Z 2025-05-07T19:46:20.6643523Z  2025-05-07T19:46:20.6643694Z 2025-05-07T19:46:20.6643698Z 2025-05-07T19:46:20.6643701Z 2025-05-07T19:46:20.6643705Z 2025-05-07T19:46:20.6643709Z 2025-05-07T19:46:20.6643712Z 2025-05-07T19:46:20.6643716Z 2025-05-07T19:46:20.6643719Z 2025-05-07T19:46:20.6643726Z 2025-05-07T19:46:20.6643730Z 2025-05-07T19:46:20.6643902Z  2025-05-07T19:46:20.6644076Z 2025-05-07T19:46:20.6644080Z 2025-05-07T19:46:20.6644083Z 2025-05-07T19:46:20.6644087Z 2025-05-07T19:46:20.6644091Z 2025-05-07T19:46:20.6644094Z 2025-05-07T19:46:20.6644101Z 2025-05-07T19:46:20.6644105Z 2025-05-07T19:46:20.6644108Z 2025-05-07T19:46:20.6644112Z 2025-05-07T19:46:20.6644115Z 2025-05-07T19:46:20.6644273Z  2025-05-07T19:46:20.6644459Z 2025-05-07T19:46:20.6644463Z 2025-05-07T19:46:20.6644467Z 2025-05-07T19:46:20.6644470Z 2025-05-07T19:46:20.6644474Z 2025-05-07T19:46:20.6644477Z 2025-05-07T19:46:20.6644481Z 2025-05-07T19:46:20.6644484Z 2025-05-07T19:46:20.6644488Z 2025-05-07T19:46:20.6644492Z 2025-05-07T19:46:20.6644495Z 2025-05-07T19:46:20.6644499Z 2025-05-07T19:46:20.6644660Z  2025-05-07T19:46:20.6644854Z 2025-05-07T19:46:20.6644858Z 2025-05-07T19:46:20.6644862Z 2025-05-07T19:46:20.6644990Z 2025-05-07T19:46:20.6644995Z 2025-05-07T19:46:20.6644998Z 2025-05-07T19:46:20.6645002Z 2025-05-07T19:46:20.6645005Z 2025-05-07T19:46:20.6645009Z 2025-05-07T19:46:20.6645012Z 2025-05-07T19:46:20.6645016Z 2025-05-07T19:46:20.6645020Z 2025-05-07T19:46:20.6645024Z 2025-05-07T19:46:20.6645273Z  2025-05-07T19:46:20.6645479Z 2025-05-07T19:46:20.6645482Z 2025-05-07T19:46:20.6645486Z 2025-05-07T19:46:20.6645489Z 2025-05-07T19:46:20.6645493Z 2025-05-07T19:46:20.6645497Z 2025-05-07T19:46:20.6645500Z 2025-05-07T19:46:20.6645504Z 2025-05-07T19:46:20.6645508Z 2025-05-07T19:46:20.6645511Z 2025-05-07T19:46:20.6645515Z 2025-05-07T19:46:20.6645518Z 2025-05-07T19:46:20.6645522Z 2025-05-07T19:46:20.6645543Z 2025-05-07T19:46:20.6645701Z  2025-05-07T19:46:20.6645910Z 2025-05-07T19:46:20.6645913Z 2025-05-07T19:46:20.6645917Z 2025-05-07T19:46:20.6645920Z 2025-05-07T19:46:20.6645924Z 2025-05-07T19:46:20.6645931Z 2025-05-07T19:46:20.6645935Z 2025-05-07T19:46:20.6645939Z 2025-05-07T19:46:20.6645942Z 2025-05-07T19:46:20.6645966Z 2025-05-07T19:46:20.6645970Z 2025-05-07T19:46:20.6645973Z 2025-05-07T19:46:20.6645977Z 2025-05-07T19:46:20.6645980Z 2025-05-07T19:46:20.6645983Z 2025-05-07T19:46:20.6646163Z  2025-05-07T19:46:20.6646377Z 2025-05-07T19:46:20.6646381Z 2025-05-07T19:46:20.6646385Z 2025-05-07T19:46:20.6646389Z 2025-05-07T19:46:20.6646392Z 2025-05-07T19:46:20.6646415Z 2025-05-07T19:46:20.6646419Z 2025-05-07T19:46:20.6646423Z 2025-05-07T19:46:20.6646426Z 2025-05-07T19:46:20.6646430Z 2025-05-07T19:46:20.6646433Z 2025-05-07T19:46:20.6646437Z 2025-05-07T19:46:20.6646440Z 2025-05-07T19:46:20.6646444Z 2025-05-07T19:46:20.6646447Z 2025-05-07T19:46:20.6646451Z 2025-05-07T19:46:20.6646623Z  2025-05-07T19:46:20.6646868Z 2025-05-07T19:46:20.6646871Z 2025-05-07T19:46:20.6646875Z 2025-05-07T19:46:20.6646882Z 2025-05-07T19:46:20.6646886Z 2025-05-07T19:46:20.6646890Z 2025-05-07T19:46:20.6646893Z 2025-05-07T19:46:20.6646897Z 2025-05-07T19:46:20.6646900Z 2025-05-07T19:46:20.6646904Z 2025-05-07T19:46:20.6646907Z 2025-05-07T19:46:20.6646911Z 2025-05-07T19:46:20.6646914Z 2025-05-07T19:46:20.6646921Z 2025-05-07T19:46:20.6646924Z 2025-05-07T19:46:20.6646928Z 2025-05-07T19:46:20.6646932Z 2025-05-07T19:46:20.6647117Z  2025-05-07T19:46:20.6647347Z 2025-05-07T19:46:20.6647351Z 2025-05-07T19:46:20.6647355Z 2025-05-07T19:46:20.6647358Z 2025-05-07T19:46:20.6647362Z 2025-05-07T19:46:20.6647365Z 2025-05-07T19:46:20.6647369Z 2025-05-07T19:46:20.6647372Z 2025-05-07T19:46:20.6647376Z 2025-05-07T19:46:20.6647379Z 2025-05-07T19:46:20.6647383Z 2025-05-07T19:46:20.6647386Z 2025-05-07T19:46:20.6647390Z 2025-05-07T19:46:20.6647393Z 2025-05-07T19:46:20.6647397Z 2025-05-07T19:46:20.6647401Z 2025-05-07T19:46:20.6647404Z 2025-05-07T19:46:20.6647435Z 2025-05-07T19:46:20.6647622Z  2025-05-07T19:46:20.6647850Z 2025-05-07T19:46:20.6647854Z 2025-05-07T19:46:20.6647956Z  2025-05-07T19:46:20.6648085Z 2025-05-07T19:46:20.6648089Z 2025-05-07T19:46:20.6648192Z  2025-05-07T19:46:20.6648309Z 2025-05-07T19:46:20.6648316Z 2025-05-07T19:46:20.6648320Z 2025-05-07T19:46:20.6648447Z  2025-05-07T19:46:20.6648563Z 2025-05-07T19:46:20.6648566Z 2025-05-07T19:46:20.6648570Z 2025-05-07T19:46:20.6648574Z 2025-05-07T19:46:20.6648684Z  2025-05-07T19:46:20.6648827Z 2025-05-07T19:46:20.6648830Z 2025-05-07T19:46:20.6648834Z 2025-05-07T19:46:20.6648837Z 2025-05-07T19:46:20.6648841Z 2025-05-07T19:46:20.6648950Z  2025-05-07T19:46:20.6649085Z 2025-05-07T19:46:20.6649107Z 2025-05-07T19:46:20.6649111Z 2025-05-07T19:46:20.6649114Z 2025-05-07T19:46:20.6649117Z 2025-05-07T19:46:20.6649121Z 2025-05-07T19:46:20.6649237Z  2025-05-07T19:46:20.6649374Z 2025-05-07T19:46:20.6649440Z 2025-05-07T19:46:20.6649444Z 2025-05-07T19:46:20.6649447Z 2025-05-07T19:46:20.6649450Z 2025-05-07T19:46:20.6649454Z 2025-05-07T19:46:20.6649476Z 2025-05-07T19:46:20.6649594Z  2025-05-07T19:46:20.6649753Z 2025-05-07T19:46:20.6649757Z 2025-05-07T19:46:20.6649821Z 2025-05-07T19:46:20.6649824Z 2025-05-07T19:46:20.6649828Z 2025-05-07T19:46:20.6649831Z 2025-05-07T19:46:20.6649835Z 2025-05-07T19:46:20.6649838Z 2025-05-07T19:46:20.6649987Z  2025-05-07T19:46:20.6650146Z 2025-05-07T19:46:20.6650149Z 2025-05-07T19:46:20.6650153Z 2025-05-07T19:46:20.6650156Z 2025-05-07T19:46:20.6650160Z 2025-05-07T19:46:20.6650164Z 2025-05-07T19:46:20.6650167Z 2025-05-07T19:46:20.6650171Z 2025-05-07T19:46:20.6650174Z 2025-05-07T19:46:20.6650457Z  2025-05-07T19:46:20.6650624Z 2025-05-07T19:46:20.6650628Z 2025-05-07T19:46:20.6650632Z 2025-05-07T19:46:20.6650635Z 2025-05-07T19:46:20.6650639Z 2025-05-07T19:46:20.6650646Z 2025-05-07T19:46:20.6650650Z 2025-05-07T19:46:20.6650654Z 2025-05-07T19:46:20.6650657Z 2025-05-07T19:46:20.6650661Z 2025-05-07T19:46:20.6650817Z  2025-05-07T19:46:20.6650998Z 2025-05-07T19:46:20.6651002Z 2025-05-07T19:46:20.6651005Z 2025-05-07T19:46:20.6651009Z 2025-05-07T19:46:20.6651016Z 2025-05-07T19:46:20.6651020Z 2025-05-07T19:46:20.6651023Z 2025-05-07T19:46:20.6651027Z 2025-05-07T19:46:20.6651030Z 2025-05-07T19:46:20.6651034Z 2025-05-07T19:46:20.6651037Z 2025-05-07T19:46:20.6651192Z  2025-05-07T19:46:20.6651393Z 2025-05-07T19:46:20.6651396Z 2025-05-07T19:46:20.6651400Z 2025-05-07T19:46:20.6651403Z 2025-05-07T19:46:20.6651407Z 2025-05-07T19:46:20.6651411Z 2025-05-07T19:46:20.6651414Z 2025-05-07T19:46:20.6651418Z 2025-05-07T19:46:20.6651421Z 2025-05-07T19:46:20.6651425Z 2025-05-07T19:46:20.6651429Z 2025-05-07T19:46:20.6651432Z 2025-05-07T19:46:20.6651643Z  2025-05-07T19:46:20.6651929Z 2025-05-07T19:46:20.6651933Z 2025-05-07T19:46:20.6651936Z 2025-05-07T19:46:20.6651939Z 2025-05-07T19:46:20.6651943Z 2025-05-07T19:46:20.6651947Z 2025-05-07T19:46:20.6651950Z 2025-05-07T19:46:20.6651954Z 2025-05-07T19:46:20.6651957Z 2025-05-07T19:46:20.6651961Z 2025-05-07T19:46:20.6651968Z 2025-05-07T19:46:20.6651971Z 2025-05-07T19:46:20.6651975Z 2025-05-07T19:46:20.6652143Z  2025-05-07T19:46:20.6652363Z 2025-05-07T19:46:20.6652368Z 2025-05-07T19:46:20.6652374Z 2025-05-07T19:46:20.6652381Z 2025-05-07T19:46:20.6652387Z 2025-05-07T19:46:20.6652393Z 2025-05-07T19:46:20.6652398Z 2025-05-07T19:46:20.6652404Z 2025-05-07T19:46:20.6652410Z 2025-05-07T19:46:20.6652415Z 2025-05-07T19:46:20.6652422Z 2025-05-07T19:46:20.6652448Z 2025-05-07T19:46:20.6652453Z 2025-05-07T19:46:20.6652460Z 2025-05-07T19:46:20.6652652Z  2025-05-07T19:46:20.6652866Z 2025-05-07T19:46:20.6652869Z 2025-05-07T19:46:20.6652872Z 2025-05-07T19:46:20.6652880Z 2025-05-07T19:46:20.6652884Z 2025-05-07T19:46:20.6652887Z 2025-05-07T19:46:20.6652891Z 2025-05-07T19:46:20.6652894Z 2025-05-07T19:46:20.6652916Z 2025-05-07T19:46:20.6652919Z 2025-05-07T19:46:20.6652922Z 2025-05-07T19:46:20.6652926Z 2025-05-07T19:46:20.6652929Z 2025-05-07T19:46:20.6652936Z 2025-05-07T19:46:20.6652940Z 2025-05-07T19:46:20.6653100Z  2025-05-07T19:46:20.6653315Z 2025-05-07T19:46:20.6653318Z 2025-05-07T19:46:20.6653322Z 2025-05-07T19:46:20.6653344Z 2025-05-07T19:46:20.6653347Z 2025-05-07T19:46:20.6653351Z 2025-05-07T19:46:20.6653354Z 2025-05-07T19:46:20.6653358Z 2025-05-07T19:46:20.6653361Z 2025-05-07T19:46:20.6653365Z 2025-05-07T19:46:20.6653368Z 2025-05-07T19:46:20.6653372Z 2025-05-07T19:46:20.6653375Z 2025-05-07T19:46:20.6653379Z 2025-05-07T19:46:20.6653382Z 2025-05-07T19:46:20.6653386Z 2025-05-07T19:46:20.6653545Z  2025-05-07T19:46:20.6653782Z 2025-05-07T19:46:20.6653866Z 2025-05-07T19:46:20.6653870Z 2025-05-07T19:46:20.6653874Z 2025-05-07T19:46:20.6653877Z 2025-05-07T19:46:20.6653881Z 2025-05-07T19:46:20.6653884Z 2025-05-07T19:46:20.6653888Z 2025-05-07T19:46:20.6653891Z 2025-05-07T19:46:20.6653894Z 2025-05-07T19:46:20.6653898Z 2025-05-07T19:46:20.6653968Z 2025-05-07T19:46:20.6653971Z 2025-05-07T19:46:20.6653975Z 2025-05-07T19:46:20.6653978Z 2025-05-07T19:46:20.6653981Z 2025-05-07T19:46:20.6653985Z 2025-05-07T19:46:20.6654167Z  2025-05-07T19:46:20.6654391Z 2025-05-07T19:46:20.6654395Z 2025-05-07T19:46:20.6654398Z 2025-05-07T19:46:20.6654402Z 2025-05-07T19:46:20.6654405Z 2025-05-07T19:46:20.6654409Z 2025-05-07T19:46:20.6654412Z 2025-05-07T19:46:20.6654416Z 2025-05-07T19:46:20.6654419Z 2025-05-07T19:46:20.6654423Z 2025-05-07T19:46:20.6654426Z 2025-05-07T19:46:20.6654430Z 2025-05-07T19:46:20.6654433Z 2025-05-07T19:46:20.6654437Z 2025-05-07T19:46:20.6654440Z 2025-05-07T19:46:20.6654466Z 2025-05-07T19:46:20.6654470Z 2025-05-07T19:46:20.6654474Z 2025-05-07T19:46:20.6654644Z  2025-05-07T19:46:20.6654873Z 2025-05-07T19:46:20.6654877Z 2025-05-07T19:46:20.6655001Z  2025-05-07T19:46:20.6655112Z 2025-05-07T19:46:20.6655120Z 2025-05-07T19:46:20.6655226Z  2025-05-07T19:46:20.6655345Z 2025-05-07T19:46:20.6655349Z 2025-05-07T19:46:20.6655371Z 2025-05-07T19:46:20.6655478Z  2025-05-07T19:46:20.6655597Z 2025-05-07T19:46:20.6655600Z 2025-05-07T19:46:20.6655604Z 2025-05-07T19:46:20.6655608Z 2025-05-07T19:46:20.6655718Z  2025-05-07T19:46:20.6655860Z 2025-05-07T19:46:20.6655863Z 2025-05-07T19:46:20.6655867Z 2025-05-07T19:46:20.6655870Z 2025-05-07T19:46:20.6655874Z 2025-05-07T19:46:20.6655987Z  2025-05-07T19:46:20.6656140Z 2025-05-07T19:46:20.6656144Z 2025-05-07T19:46:20.6656148Z 2025-05-07T19:46:20.6656151Z 2025-05-07T19:46:20.6656155Z 2025-05-07T19:46:20.6656158Z 2025-05-07T19:46:20.6656276Z  2025-05-07T19:46:20.6656417Z 2025-05-07T19:46:20.6656420Z 2025-05-07T19:46:20.6656424Z 2025-05-07T19:46:20.6656427Z 2025-05-07T19:46:20.6656431Z 2025-05-07T19:46:20.6656451Z 2025-05-07T19:46:20.6656454Z 2025-05-07T19:46:20.6656576Z  2025-05-07T19:46:20.6656727Z 2025-05-07T19:46:20.6656731Z 2025-05-07T19:46:20.6656735Z 2025-05-07T19:46:20.6656738Z 2025-05-07T19:46:20.6656742Z 2025-05-07T19:46:20.6656745Z 2025-05-07T19:46:20.6656749Z 2025-05-07T19:46:20.6656752Z 2025-05-07T19:46:20.6656898Z  2025-05-07T19:46:20.6657059Z 2025-05-07T19:46:20.6657063Z 2025-05-07T19:46:20.6657066Z 2025-05-07T19:46:20.6657070Z 2025-05-07T19:46:20.6657073Z 2025-05-07T19:46:20.6657077Z 2025-05-07T19:46:20.6657080Z 2025-05-07T19:46:20.6657084Z 2025-05-07T19:46:20.6657087Z 2025-05-07T19:46:20.6657237Z  2025-05-07T19:46:20.6657407Z 2025-05-07T19:46:20.6657410Z 2025-05-07T19:46:20.6657414Z 2025-05-07T19:46:20.6657421Z 2025-05-07T19:46:20.6657424Z 2025-05-07T19:46:20.6657427Z 2025-05-07T19:46:20.6657431Z 2025-05-07T19:46:20.6657434Z 2025-05-07T19:46:20.6657438Z 2025-05-07T19:46:20.6657441Z 2025-05-07T19:46:20.6657590Z  2025-05-07T19:46:20.6657763Z 2025-05-07T19:46:20.6657771Z 2025-05-07T19:46:20.6657775Z 2025-05-07T19:46:20.6657778Z 2025-05-07T19:46:20.6657782Z 2025-05-07T19:46:20.6657785Z 2025-05-07T19:46:20.6657788Z 2025-05-07T19:46:20.6657792Z 2025-05-07T19:46:20.6657795Z 2025-05-07T19:46:20.6657799Z 2025-05-07T19:46:20.6657802Z 2025-05-07T19:46:20.6657964Z  2025-05-07T19:46:20.6658151Z 2025-05-07T19:46:20.6658154Z 2025-05-07T19:46:20.6658158Z 2025-05-07T19:46:20.6658161Z 2025-05-07T19:46:20.6658165Z 2025-05-07T19:46:20.6658169Z 2025-05-07T19:46:20.6658172Z 2025-05-07T19:46:20.6658176Z 2025-05-07T19:46:20.6658179Z 2025-05-07T19:46:20.6658182Z 2025-05-07T19:46:20.6658186Z 2025-05-07T19:46:20.6658189Z 2025-05-07T19:46:20.6658456Z  2025-05-07T19:46:20.6658653Z 2025-05-07T19:46:20.6658657Z 2025-05-07T19:46:20.6658660Z 2025-05-07T19:46:20.6658663Z 2025-05-07T19:46:20.6658667Z 2025-05-07T19:46:20.6658670Z 2025-05-07T19:46:20.6658674Z 2025-05-07T19:46:20.6658677Z 2025-05-07T19:46:20.6658733Z 2025-05-07T19:46:20.6658737Z 2025-05-07T19:46:20.6658740Z 2025-05-07T19:46:20.6658744Z 2025-05-07T19:46:20.6658748Z 2025-05-07T19:46:20.6658908Z  2025-05-07T19:46:20.6659110Z 2025-05-07T19:46:20.6659113Z 2025-05-07T19:46:20.6659117Z 2025-05-07T19:46:20.6659120Z 2025-05-07T19:46:20.6659124Z 2025-05-07T19:46:20.6659127Z 2025-05-07T19:46:20.6659131Z 2025-05-07T19:46:20.6659134Z 2025-05-07T19:46:20.6659138Z 2025-05-07T19:46:20.6659142Z 2025-05-07T19:46:20.6659145Z 2025-05-07T19:46:20.6659167Z 2025-05-07T19:46:20.6659170Z 2025-05-07T19:46:20.6659174Z 2025-05-07T19:46:20.6659322Z  2025-05-07T19:46:20.6659534Z 2025-05-07T19:46:20.6659538Z 2025-05-07T19:46:20.6659541Z 2025-05-07T19:46:20.6659545Z 2025-05-07T19:46:20.6659549Z 2025-05-07T19:46:20.6659552Z 2025-05-07T19:46:20.6659556Z 2025-05-07T19:46:20.6659560Z 2025-05-07T19:46:20.6659581Z 2025-05-07T19:46:20.6659584Z 2025-05-07T19:46:20.6659589Z 2025-05-07T19:46:20.6659596Z 2025-05-07T19:46:20.6659600Z 2025-05-07T19:46:20.6659603Z 2025-05-07T19:46:20.6659607Z 2025-05-07T19:46:20.6659761Z  2025-05-07T19:46:20.6659972Z 2025-05-07T19:46:20.6659975Z 2025-05-07T19:46:20.6659979Z 2025-05-07T19:46:20.6660001Z 2025-05-07T19:46:20.6660004Z 2025-05-07T19:46:20.6660008Z 2025-05-07T19:46:20.6660011Z 2025-05-07T19:46:20.6660014Z 2025-05-07T19:46:20.6660018Z 2025-05-07T19:46:20.6660021Z 2025-05-07T19:46:20.6660025Z 2025-05-07T19:46:20.6660028Z 2025-05-07T19:46:20.6660032Z 2025-05-07T19:46:20.6660036Z 2025-05-07T19:46:20.6660039Z 2025-05-07T19:46:20.6660043Z 2025-05-07T19:46:20.6660209Z  2025-05-07T19:46:20.6660461Z 2025-05-07T19:46:20.6660465Z 2025-05-07T19:46:20.6660469Z 2025-05-07T19:46:20.6660472Z 2025-05-07T19:46:20.6660475Z 2025-05-07T19:46:20.6660479Z 2025-05-07T19:46:20.6660483Z 2025-05-07T19:46:20.6660487Z 2025-05-07T19:46:20.6660490Z 2025-05-07T19:46:20.6660497Z 2025-05-07T19:46:20.6660501Z 2025-05-07T19:46:20.6660504Z 2025-05-07T19:46:20.6660507Z 2025-05-07T19:46:20.6660511Z 2025-05-07T19:46:20.6660514Z 2025-05-07T19:46:20.6660518Z 2025-05-07T19:46:20.6660521Z 2025-05-07T19:46:20.6660703Z  2025-05-07T19:46:20.6660926Z 2025-05-07T19:46:20.6660931Z 2025-05-07T19:46:20.6660934Z 2025-05-07T19:46:20.6660938Z 2025-05-07T19:46:20.6660941Z 2025-05-07T19:46:20.6660945Z 2025-05-07T19:46:20.6660948Z 2025-05-07T19:46:20.6660952Z 2025-05-07T19:46:20.6660955Z 2025-05-07T19:46:20.6660959Z 2025-05-07T19:46:20.6660963Z 2025-05-07T19:46:20.6660967Z 2025-05-07T19:46:20.6660970Z 2025-05-07T19:46:20.6660994Z 2025-05-07T19:46:20.6660998Z 2025-05-07T19:46:20.6661001Z 2025-05-07T19:46:20.6661005Z 2025-05-07T19:46:20.6661008Z 2025-05-07T19:46:20.6661176Z  2025-05-07T19:46:20.6661403Z 2025-05-07T19:46:20.6661407Z 2025-05-07T19:46:20.6661530Z  2025-05-07T19:46:20.6661640Z 2025-05-07T19:46:20.6661644Z 2025-05-07T19:46:20.6661747Z  2025-05-07T19:46:20.6661881Z 2025-05-07T19:46:20.6661884Z 2025-05-07T19:46:20.6661888Z 2025-05-07T19:46:20.6661994Z  2025-05-07T19:46:20.6662110Z 2025-05-07T19:46:20.6662113Z 2025-05-07T19:46:20.6662117Z 2025-05-07T19:46:20.6662120Z 2025-05-07T19:46:20.6662248Z  2025-05-07T19:46:20.6662373Z 2025-05-07T19:46:20.6662377Z 2025-05-07T19:46:20.6662380Z 2025-05-07T19:46:20.6662384Z 2025-05-07T19:46:20.6662387Z 2025-05-07T19:46:20.6662498Z  2025-05-07T19:46:20.6662647Z 2025-05-07T19:46:20.6662650Z 2025-05-07T19:46:20.6662654Z 2025-05-07T19:46:20.6662657Z 2025-05-07T19:46:20.6662751Z 2025-05-07T19:46:20.6662755Z 2025-05-07T19:46:20.6662873Z  2025-05-07T19:46:20.6663010Z 2025-05-07T19:46:20.6663014Z 2025-05-07T19:46:20.6663017Z 2025-05-07T19:46:20.6663037Z 2025-05-07T19:46:20.6663041Z 2025-05-07T19:46:20.6663044Z 2025-05-07T19:46:20.6664443Z 2025-05-07T19:46:20.6664565Z  2025-05-07T19:46:20.6664989Z 2025-05-07T19:46:20.6664993Z 2025-05-07T19:46:20.6665018Z 2025-05-07T19:46:20.6665022Z 2025-05-07T19:46:20.6665026Z 2025-05-07T19:46:20.6665029Z 2025-05-07T19:46:20.6665033Z 2025-05-07T19:46:20.6665036Z 2025-05-07T19:46:20.6665176Z  2025-05-07T19:46:20.6665342Z 2025-05-07T19:46:20.6665345Z 2025-05-07T19:46:20.6665349Z 2025-05-07T19:46:20.6665352Z 2025-05-07T19:46:20.6665494Z 2025-05-07T19:46:20.6665498Z 2025-05-07T19:46:20.6665501Z 2025-05-07T19:46:20.6665505Z 2025-05-07T19:46:20.6665508Z 2025-05-07T19:46:20.6665643Z  2025-05-07T19:46:20.6665812Z 2025-05-07T19:46:20.6665823Z 2025-05-07T19:46:20.6665826Z 2025-05-07T19:46:20.6665830Z 2025-05-07T19:46:20.6665833Z 2025-05-07T19:46:20.6665837Z 2025-05-07T19:46:20.6665861Z 2025-05-07T19:46:20.6665864Z 2025-05-07T19:46:20.6665867Z 2025-05-07T19:46:20.6665871Z 2025-05-07T19:46:20.6666006Z  2025-05-07T19:46:20.6666187Z 2025-05-07T19:46:20.6666191Z 2025-05-07T19:46:20.6666268Z 2025-05-07T19:46:20.6666272Z 2025-05-07T19:46:20.6666276Z 2025-05-07T19:46:20.6666279Z 2025-05-07T19:46:20.6666283Z 2025-05-07T19:46:20.6666287Z 2025-05-07T19:46:20.6666290Z 2025-05-07T19:46:20.6666294Z 2025-05-07T19:46:20.6666297Z 2025-05-07T19:46:20.6666433Z  2025-05-07T19:46:20.6666639Z 2025-05-07T19:46:20.6666642Z 2025-05-07T19:46:20.6666646Z 2025-05-07T19:46:20.6666649Z 2025-05-07T19:46:20.6666653Z 2025-05-07T19:46:20.6666656Z 2025-05-07T19:46:20.6666660Z 2025-05-07T19:46:20.6666663Z 2025-05-07T19:46:20.6666667Z 2025-05-07T19:46:20.6666670Z 2025-05-07T19:46:20.6666677Z 2025-05-07T19:46:20.6666680Z 2025-05-07T19:46:20.6666841Z  2025-05-07T19:46:20.6667035Z 2025-05-07T19:46:20.6667038Z 2025-05-07T19:46:20.6667042Z 2025-05-07T19:46:20.6667045Z 2025-05-07T19:46:20.6667049Z 2025-05-07T19:46:20.6667052Z 2025-05-07T19:46:20.6667059Z 2025-05-07T19:46:20.6667063Z 2025-05-07T19:46:20.6667066Z 2025-05-07T19:46:20.6667070Z 2025-05-07T19:46:20.6667073Z 2025-05-07T19:46:20.6667077Z 2025-05-07T19:46:20.6667081Z 2025-05-07T19:46:20.6667244Z  2025-05-07T19:46:20.6667445Z 2025-05-07T19:46:20.6667449Z 2025-05-07T19:46:20.6667453Z 2025-05-07T19:46:20.6667456Z 2025-05-07T19:46:20.6667460Z 2025-05-07T19:46:20.6667463Z 2025-05-07T19:46:20.6667467Z 2025-05-07T19:46:20.6667470Z 2025-05-07T19:46:20.6667474Z 2025-05-07T19:46:20.6667477Z 2025-05-07T19:46:20.6667480Z 2025-05-07T19:46:20.6667484Z 2025-05-07T19:46:20.6667487Z 2025-05-07T19:46:20.6667491Z 2025-05-07T19:46:20.6667666Z  2025-05-07T19:46:20.6667874Z 2025-05-07T19:46:20.6667878Z 2025-05-07T19:46:20.6667881Z 2025-05-07T19:46:20.6667885Z 2025-05-07T19:46:20.6667888Z 2025-05-07T19:46:20.6667892Z 2025-05-07T19:46:20.6667895Z 2025-05-07T19:46:20.6667898Z 2025-05-07T19:46:20.6667906Z 2025-05-07T19:46:20.6667910Z 2025-05-07T19:46:20.6667913Z 2025-05-07T19:46:20.6667937Z 2025-05-07T19:46:20.6667940Z 2025-05-07T19:46:20.6667943Z 2025-05-07T19:46:20.6667947Z 2025-05-07T19:46:20.6668105Z  2025-05-07T19:46:20.6668322Z 2025-05-07T19:46:20.6668325Z 2025-05-07T19:46:20.6668329Z 2025-05-07T19:46:20.6668332Z 2025-05-07T19:46:20.6668336Z 2025-05-07T19:46:20.6668340Z 2025-05-07T19:46:20.6668343Z 2025-05-07T19:46:20.6668366Z 2025-05-07T19:46:20.6668370Z 2025-05-07T19:46:20.6668373Z 2025-05-07T19:46:20.6668376Z 2025-05-07T19:46:20.6668380Z 2025-05-07T19:46:20.6668383Z 2025-05-07T19:46:20.6668387Z 2025-05-07T19:46:20.6668391Z 2025-05-07T19:46:20.6668519Z 2025-05-07T19:46:20.6668687Z  2025-05-07T19:46:20.6668913Z 2025-05-07T19:46:20.6668936Z 2025-05-07T19:46:20.6668940Z 2025-05-07T19:46:20.6668943Z 2025-05-07T19:46:20.6668946Z 2025-05-07T19:46:20.6668950Z 2025-05-07T19:46:20.6669041Z 2025-05-07T19:46:20.6669045Z 2025-05-07T19:46:20.6669049Z 2025-05-07T19:46:20.6669052Z 2025-05-07T19:46:20.6669056Z 2025-05-07T19:46:20.6669060Z 2025-05-07T19:46:20.6669063Z 2025-05-07T19:46:20.6669067Z 2025-05-07T19:46:20.6669070Z 2025-05-07T19:46:20.6669074Z 2025-05-07T19:46:20.6669077Z 2025-05-07T19:46:20.6669244Z  2025-05-07T19:46:20.6669491Z 2025-05-07T19:46:20.6669495Z 2025-05-07T19:46:20.6669498Z 2025-05-07T19:46:20.6669502Z 2025-05-07T19:46:20.6669505Z 2025-05-07T19:46:20.6669509Z 2025-05-07T19:46:20.6669512Z 2025-05-07T19:46:20.6669516Z 2025-05-07T19:46:20.6669520Z 2025-05-07T19:46:20.6669523Z 2025-05-07T19:46:20.6669527Z 2025-05-07T19:46:20.6669534Z 2025-05-07T19:46:20.6669537Z 2025-05-07T19:46:20.6669541Z 2025-05-07T19:46:20.6669545Z 2025-05-07T19:46:20.6669548Z 2025-05-07T19:46:20.6669552Z 2025-05-07T19:46:20.6669555Z 2025-05-07T19:46:20.6669750Z  2025-05-07T19:46:20.6669985Z 2025-05-07T19:46:20.6669989Z 2025-05-07T19:46:20.6670096Z  2025-05-07T19:46:20.6670230Z 2025-05-07T19:46:20.6670233Z 2025-05-07T19:46:20.6670343Z  2025-05-07T19:46:20.6670458Z 2025-05-07T19:46:20.6670461Z 2025-05-07T19:46:20.6670465Z 2025-05-07T19:46:20.6670593Z  2025-05-07T19:46:20.6670711Z 2025-05-07T19:46:20.6670715Z 2025-05-07T19:46:20.6670719Z 2025-05-07T19:46:20.6670722Z 2025-05-07T19:46:20.6670838Z  2025-05-07T19:46:20.6670986Z 2025-05-07T19:46:20.6670990Z 2025-05-07T19:46:20.6670994Z 2025-05-07T19:46:20.6670997Z 2025-05-07T19:46:20.6671001Z 2025-05-07T19:46:20.6671115Z  2025-05-07T19:46:20.6671251Z 2025-05-07T19:46:20.6671259Z 2025-05-07T19:46:20.6671262Z 2025-05-07T19:46:20.6671285Z 2025-05-07T19:46:20.6671289Z 2025-05-07T19:46:20.6671292Z 2025-05-07T19:46:20.6671493Z  2025-05-07T19:46:20.6671632Z 2025-05-07T19:46:20.6671636Z 2025-05-07T19:46:20.6671639Z 2025-05-07T19:46:20.6671643Z 2025-05-07T19:46:20.6671650Z 2025-05-07T19:46:20.6671654Z 2025-05-07T19:46:20.6671657Z 2025-05-07T19:46:20.6671801Z  2025-05-07T19:46:20.6671951Z 2025-05-07T19:46:20.6671954Z 2025-05-07T19:46:20.6671958Z 2025-05-07T19:46:20.6671961Z 2025-05-07T19:46:20.6671965Z 2025-05-07T19:46:20.6671968Z 2025-05-07T19:46:20.6671971Z 2025-05-07T19:46:20.6671975Z 2025-05-07T19:46:20.6672134Z  done 2025-05-07T19:46:20.8776140Z Preparing transaction: \ | done 2025-05-07T19:46:21.5802341Z Verifying transaction: - \ | / - \ | done 2025-05-07T19:46:21.8855637Z Executing transaction: - \ | done 2025-05-07T19:46:23.6325117Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:23.6326345Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:23.6328537Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:23.6330384Z 2025-05-07T19:46:23.6341295Z 2025-05-07T19:46:23.6343599Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:23.6355057Z 2025-05-07T19:46:23.6355074Z 2025-05-07T19:46:23.6355520Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:23.6361089Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:23.6366077Z 2025-05-07T19:46:24.0853069Z 2025-05-07T19:46:24.0857787Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:24.0862394Z 2025-05-07T19:46:24.0874592Z 2025-05-07T19:46:24.0874952Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:24.1287102Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:25.7958156Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:25.7958948Z 2025-05-07T19:46:26.2129648Z 2025-05-07T19:46:26.2142019Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:26.2507731Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:26.2508390Z 2025-05-07T19:46:26.6674565Z 2025-05-07T19:46:26.6674974Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:26.6676079Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:26.6676907Z 2025-05-07T19:46:27.0826732Z 2025-05-07T19:46:28.7888181Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:30.4956816Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:32.1996901Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:32.1999493Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:33.9305615Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:35.5140196Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:35.5140699Z 2025-05-07T19:46:35.5715443Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:38.8033717Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:38.8034409Z Target: x86_64-conda-linux-gnu 2025-05-07T19:46:38.8034707Z Thread model: posix 2025-05-07T19:46:38.8035034Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:46:38.8035999Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:46:38.8036609Z 2025-05-07T19:46:38.8594797Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:46:42.1315810Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:46:42.1317833Z 2025-05-07T19:46:42.1338610Z 2025-05-07T19:46:42.1355508Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:46:42.1357036Z 2025-05-07T19:46:42.1367666Z 2025-05-07T19:46:42.1394629Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:42.1395168Z 2025-05-07T19:46:42.1412211Z 2025-05-07T19:46:42.1433948Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:46:42.1434502Z 2025-05-07T19:46:42.1449888Z 2025-05-07T19:46:42.1450447Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:42.1450839Z 2025-05-07T19:46:42.1468076Z total 20 2025-05-07T19:46:42.1468452Z drwxr-xr-x. 2 root root 154 May 7 19:46 . 2025-05-07T19:46:42.1468887Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:42.1469448Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:42.1470069Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:42.1470515Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:46:42.1470937Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:46:42.1471776Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:42.1472087Z 2025-05-07T19:46:42.1472334Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:42.1473038Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:42.1473518Z 2025-05-07T19:46:42.1493601Z 2025-05-07T19:46:42.1493781Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:42.1494060Z 2025-05-07T19:46:43.8372512Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:43.8373488Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:46:43.8375972Z 2025-05-07T19:46:43.8376214Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:46:45.5359363Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:45.5362133Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:46:45.5364373Z 2025-05-07T19:46:45.9511833Z 2025-05-07T19:46:45.9512549Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:45.9513457Z 2025-05-07T19:46:47.5773627Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:47.5774250Z 2025-05-07T19:46:47.6547822Z 2025-05-07T19:46:47.6548658Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:47.6549897Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:47.6550247Z 2025-05-07T19:46:49.3088030Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:49.3088937Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:49.3089742Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:49.3090484Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:49.3091216Z #define ADJ_NANO 0x2000 2025-05-07T19:46:49.3091905Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:49.3092716Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:49.3093645Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:49.3093935Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:49.3094207Z #define ADJ_TAI 0x0080 2025-05-07T19:46:49.3094822Z #define ADJ_TICK 0x4000 2025-05-07T19:46:49.3095100Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:49.3095409Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:49.3095702Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:49.3096044Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:49.3096526Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:49.3096879Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:49.3097208Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:49.3097509Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:49.3097773Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:49.3098082Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:49.3098383Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:49.3098665Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:49.3098959Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:49.3099230Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:49.3099526Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:49.3099793Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:49.3100093Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:49.3100358Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:49.3100670Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:49.3100962Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:49.3101281Z #define CLOCK_REALTIME 0 2025-05-07T19:46:49.3101591Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:49.3101885Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:49.3102200Z #define CLOCK_TAI 11 2025-05-07T19:46:49.3102461Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:49.3102779Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:49.3103047Z #define CUDARTAPI 2025-05-07T19:46:49.3103318Z #define CUDARTAPI_CDECL 2025-05-07T19:46:49.3103570Z #define CUDART_CB 2025-05-07T19:46:49.3103835Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:49.3104122Z #define CUDART_VERSION 12060 2025-05-07T19:46:49.3104430Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:49.3104762Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:49.3105062Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:49.3105409Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:49.3105704Z #define DOMAIN 1 2025-05-07T19:46:49.3105949Z #define EOF (-1) 2025-05-07T19:46:49.3106177Z #define EXIT_FAILURE 1 2025-05-07T19:46:49.3106450Z #define EXIT_SUCCESS 0 2025-05-07T19:46:49.3106718Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:49.3107105Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:49.3107499Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:49.3107929Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:49.3108318Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:49.3108640Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:49.3108989Z #define FILENAME_MAX 4096 2025-05-07T19:46:49.3109248Z #define FOPEN_MAX 16 2025-05-07T19:46:49.3109529Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:49.3109843Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:49.3110160Z #define FP_INFINITE 1 2025-05-07T19:46:49.3110420Z #define FP_NAN 0 2025-05-07T19:46:49.3110693Z #define FP_NORMAL 4 2025-05-07T19:46:49.3110946Z #define FP_SUBNORMAL 3 2025-05-07T19:46:49.3111236Z #define FP_ZERO 2 2025-05-07T19:46:49.3111674Z #define HOST_NAME_MAX 64 2025-05-07T19:46:49.3111935Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:49.3112255Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:49.3112587Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:49.3112935Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:49.3113271Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:49.3113590Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:49.3113878Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:46:49.3114181Z #define IOV_MAX 1024 2025-05-07T19:46:49.3114443Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:49.3114774Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:49.3115133Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:49.3115471Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:49.3115919Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:49.3116232Z #define LONG_BIT 64 2025-05-07T19:46:49.3116496Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:49.3116863Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:49.3117209Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:49.3117613Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:46:49.3117954Z #define L_ctermid 9 2025-05-07T19:46:49.3118194Z #define L_cuserid 9 2025-05-07T19:46:49.3118451Z #define L_tmpnam 20 2025-05-07T19:46:49.3118690Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:49.3118977Z #define MATH_ERRNO 1 2025-05-07T19:46:49.3119228Z #define MAX_CANON 255 2025-05-07T19:46:49.3119499Z #define MAX_INPUT 255 2025-05-07T19:46:49.3119781Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:49.3120130Z #define MB_LEN_MAX 16 2025-05-07T19:46:49.3120395Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:49.3120716Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:49.3121014Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:49.3121319Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:49.3121640Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:49.3121931Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:49.3122622Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:49.3123006Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:49.3123296Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:49.3123560Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:49.3123830Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:49.3124114Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:49.3124403Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:49.3124769Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:49.3125123Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:49.3125484Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:49.3125845Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:49.3126235Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:49.3126618Z #define M_E 2.7182818284590452354 2025-05-07T19:46:49.3126976Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:49.3127305Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:49.3127642Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:49.3127990Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:49.3128309Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:49.3128656Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:49.3128991Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:49.3129355Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:49.3129682Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:49.3130041Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:49.3130331Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:49.3130671Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:49.3131020Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:49.3131352Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:49.3131739Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:49.3132088Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:49.3132447Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:49.3132798Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:49.3133143Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:49.3133472Z #define NAME_MAX 255 2025-05-07T19:46:49.3133737Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:49.3134035Z #define NFDBITS __NFDBITS 2025-05-07T19:46:49.3134293Z #define NGROUPS_MAX 65536 2025-05-07T19:46:49.3134572Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:49.3134866Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:49.3135177Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:49.3135426Z #define NL_NMAX INT_MAX 2025-05-07T19:46:49.3135690Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:49.3136059Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:49.3136384Z #define NULL __null 2025-05-07T19:46:49.3136610Z #define NZERO 20 2025-05-07T19:46:49.3136856Z #define OVERFLOW 3 2025-05-07T19:46:49.3137102Z #define PATH_MAX 4096 2025-05-07T19:46:49.3137371Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:49.3137867Z #define PIPE_BUF 4096 2025-05-07T19:46:49.3138252Z #define PLOSS 6 2025-05-07T19:46:49.3138649Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:49.3139104Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:49.3139404Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:49.3139689Z #define P_tmpdir "/tmp" 2025-05-07T19:46:49.3139959Z #define RAND_MAX 2147483647 2025-05-07T19:46:49.3140224Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:49.3140499Z #define RTSIG_MAX 32 2025-05-07T19:46:49.3140765Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:49.3141051Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:46:49.3141362Z #define SEEK_CUR 1 2025-05-07T19:46:49.3141597Z #define SEEK_DATA 3 2025-05-07T19:46:49.3141854Z #define SEEK_END 2 2025-05-07T19:46:49.3142085Z #define SEEK_HOLE 4 2025-05-07T19:46:49.3142333Z #define SEEK_SET 0 2025-05-07T19:46:49.3142576Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:49.3142888Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:49.3143170Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:46:49.3143472Z #define SING 2 2025-05-07T19:46:49.3143702Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:49.3143989Z #define STA_CLK 0x8000 2025-05-07T19:46:49.3144263Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:49.3144532Z #define STA_DEL 0x0020 2025-05-07T19:46:49.3144794Z #define STA_FLL 0x0008 2025-05-07T19:46:49.3145044Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:49.3145323Z #define STA_INS 0x0010 2025-05-07T19:46:49.3145571Z #define STA_MODE 0x4000 2025-05-07T19:46:49.3145838Z #define STA_NANO 0x2000 2025-05-07T19:46:49.3146088Z #define STA_PLL 0x0001 2025-05-07T19:46:49.3146357Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:49.3146632Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:49.3146930Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:49.3147236Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:49.3147520Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:49.3147811Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:49.3148516Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:49.3149138Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:49.3149398Z #define TIMER_ABSTIME 1 2025-05-07T19:46:49.3149661Z #define TIME_UTC 1 2025-05-07T19:46:49.3149886Z #define TLOSS 5 2025-05-07T19:46:49.3150129Z #define TMP_MAX 238328 2025-05-07T19:46:49.3150384Z #define TTY_NAME_MAX 32 2025-05-07T19:46:49.3150663Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:46:49.3150985Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:46:49.3151312Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:49.3152003Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:49.3152375Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:46:49.3152705Z #define UNDERFLOW 4 2025-05-07T19:46:49.3152953Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:46:49.3153263Z #define WCONTINUED 8 2025-05-07T19:46:49.3153502Z #define WEXITED 4 2025-05-07T19:46:49.3153848Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:49.3154349Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:49.3154844Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:49.3155325Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:49.3155804Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:49.3156199Z #define WNOHANG 1 2025-05-07T19:46:49.3156439Z #define WNOWAIT 0x01000000 2025-05-07T19:46:49.3156712Z #define WORD_BIT 32 2025-05-07T19:46:49.3156944Z #define WSTOPPED 2 2025-05-07T19:46:49.3157270Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:49.3157821Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:49.3158205Z #define WUNTRACED 2 2025-05-07T19:46:49.3158470Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:49.3158748Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:49.3159035Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:49.3159423Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:49.3159748Z #define _ACRTIMP 2025-05-07T19:46:49.3159974Z #define _ALLOCA_H 1 2025-05-07T19:46:49.3160223Z #define _ASSERT_H 1 2025-05-07T19:46:49.3160465Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:49.3160741Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:49.3161009Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:49.3161300Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:49.3161600Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:49.3161881Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:49.3162147Z #define _BITS_TIME_H 1 2025-05-07T19:46:49.3162397Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:49.3162679Z #define _BITS_TYPES_H 1 2025-05-07T19:46:49.3162927Z #define _BSD_SOURCE 1 2025-05-07T19:46:49.3163189Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:49.3163458Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:49.3163747Z #define _CRTIMP 2025-05-07T19:46:49.3163973Z #define _CTYPE_H 1 2025-05-07T19:46:49.3164215Z #define _ENDIAN_H 1 2025-05-07T19:46:49.3164466Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:49.3165013Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:49.3165311Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:49.3165574Z #define _FEATURES_H 1 2025-05-07T19:46:49.3165843Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:49.3166101Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:49.3166416Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:49.3166909Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:49.3167390Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:49.3167695Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:49.3168007Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:49.3168333Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:49.3168629Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:49.3168949Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:49.3169268Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:49.3169625Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:49.3170106Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:49.3170574Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:49.3170865Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:49.3171175Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:49.3171496Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:49.3171838Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:49.3172155Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:49.3172444Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:49.3172756Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:49.3173053Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:49.3173456Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:49.3173871Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:49.3174213Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:49.3174552Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:49.3174907Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:49.3175322Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:49.3175710Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:49.3176173Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:49.3176763Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:49.3177098Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:49.3177403Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:49.3177697Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:49.3177990Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:49.3178353Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:49.3178822Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:49.3179108Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:49.3179392Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:49.3179687Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:49.3180040Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:49.3180465Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:49.3180800Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:49.3181125Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:49.3181487Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:49.3181989Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:49.3182427Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:49.3183008Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:49.3183529Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:49.3183857Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:49.3184143Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:49.3184473Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:49.3184789Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:49.3185112Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:49.3185535Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:49.3185963Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:49.3186293Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:49.3186574Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:49.3186859Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:49.3187255Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:49.3187637Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:49.3187929Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:49.3188223Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:49.3189101Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:49.3189987Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:49.3190265Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:49.3190530Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:49.3190841Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:49.3191118Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:49.3191488Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:49.3191971Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:49.3192304Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:49.3192676Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:49.3192955Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:49.3193251Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:49.3193547Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:49.3193909Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:49.3194249Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:49.3194602Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:49.3194984Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:49.3195361Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:49.3195745Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:49.3196071Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:49.3196400Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:49.3196683Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:49.3196983Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:49.3197279Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:49.3197570Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:49.3197848Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:49.3198139Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:49.3198434Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:49.3198722Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:49.3199023Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:49.3199442Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:49.3199820Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:49.3200121Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:49.3200421Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:49.3200698Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:49.3201064Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:49.3201346Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:49.3201648Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:49.3201947Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:49.3202228Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:49.3202529Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:49.3202811Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:49.3203112Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:49.3203387Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:49.3203683Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:49.3203957Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:49.3204254Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:49.3204534Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:49.3204831Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:49.3205140Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:49.3205415Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:49.3205708Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:49.3205995Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:49.3206287Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:49.3206562Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:49.3206871Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:49.3207177Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:49.3207487Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:49.3207770Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:49.3208069Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:49.3208374Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:49.3208655Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:49.3208945Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:49.3209345Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:49.3209763Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:49.3210043Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:49.3210319Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:49.3210584Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:49.3210877Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:49.3211163Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:49.3211461Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:49.3211759Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:49.3212052Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:49.3212354Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:49.3212668Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:49.3213021Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:49.3213336Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:49.3213661Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:49.3213945Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:49.3214262Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:49.3214551Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:49.3214860Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:49.3215190Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:49.3215487Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:49.3215813Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:49.3216091Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:49.3216385Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:49.3216654Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:49.3216946Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:49.3217242Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:49.3217564Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:49.3217886Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:49.3218178Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:49.3218468Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:49.3218746Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:49.3219057Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:49.3219435Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:49.3219735Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:49.3220006Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:49.3220328Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:49.3220613Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:49.3220910Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:49.3221281Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:49.3221558Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:49.3221858Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:49.3222131Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:49.3222432Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:49.3222709Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:49.3223016Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:49.3223314Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:49.3223621Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:49.3223907Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:49.3224227Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:49.3224546Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:49.3224829Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:49.3225145Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:49.3225437Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:49.3225731Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:49.3226003Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:49.3226288Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:49.3226553Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:49.3226855Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:49.3227343Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:49.3227649Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:49.3228023Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:49.3228417Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:49.3228732Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:49.3229022Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:49.3229336Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:49.3229643Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:49.3229956Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:49.3230413Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:49.3230729Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:49.3231025Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:49.3231345Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:49.3231754Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:49.3232235Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:49.3232544Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:49.3232907Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:49.3233198Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:49.3233476Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:49.3233772Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:49.3234047Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:49.3234347Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:49.3234620Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:49.3234923Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:49.3235226Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:49.3235515Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:49.3235808Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:49.3236092Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:49.3236389Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:49.3236671Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:49.3236969Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:49.3237250Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:49.3237549Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:49.3237835Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:49.3238138Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:49.3238418Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:49.3238698Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:49.3239017Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:49.3239547Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:49.3240339Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:49.3240781Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:49.3241084Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:49.3241371Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:49.3241792Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:49.3242404Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:49.3242875Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:49.3243222Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:49.3243613Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:49.3244338Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:49.3244819Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:49.3245153Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:49.3245509Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:49.3245863Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:49.3246204Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:49.3246574Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:49.3246959Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:49.3247380Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:49.3247802Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:49.3248084Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:49.3248368Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:49.3248709Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:49.3249114Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:49.3249535Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:49.3249850Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:49.3250202Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:49.3250573Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:49.3250887Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:49.3251210Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:49.3251546Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:49.3251827Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:49.3252096Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:49.3252390Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:49.3252656Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:49.3252948Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:49.3253217Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:49.3253478Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:49.3253721Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:49.3253981Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:49.3254222Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:49.3254534Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:49.3254914Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:49.3255247Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:49.3255556Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:49.3255903Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:49.3256224Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:49.3256522Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:49.3256846Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:49.3257140Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:49.3257447Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:49.3257782Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:49.3258111Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:49.3258447Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:49.3258744Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:49.3259055Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:49.3259369Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:49.3259690Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:49.3260025Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:49.3260504Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:49.3260788Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:49.3261108Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:49.3261458Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:49.3261910Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:49.3262229Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:49.3262525Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:49.3262840Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:49.3263175Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:49.3263566Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:49.3263915Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:49.3264230Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:49.3264589Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:49.3265352Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:49.3265796Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:49.3266167Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:49.3266506Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:49.3266831Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:49.3267174Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:49.3267468Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:49.3267787Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:49.3268105Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:49.3268399Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:49.3268696Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:49.3268975Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:49.3269267Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:49.3269551Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:49.3269871Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:49.3270155Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:49.3270446Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:49.3270717Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:49.3271015Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:49.3271306Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:49.3271718Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:49.3272062Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:49.3272384Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:49.3272694Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:49.3272977Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:49.3273313Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:49.3273650Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:49.3273960Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:49.3274270Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:49.3274645Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:49.3275051Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:49.3275329Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:49.3275625Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:49.3275909Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:49.3276218Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:49.3276529Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:49.3276809Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:49.3277182Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:49.3277619Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:49.3277933Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:49.3278211Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:49.3278496Z #define _GNU_SOURCE 1 2025-05-07T19:46:49.3278758Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:49.3279082Z #define _G_BUFSIZ 8192 2025-05-07T19:46:49.3279328Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:49.3279601Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:49.3279924Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:49.3280322Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:49.3280778Z #define _G_config_h 1 2025-05-07T19:46:49.3281055Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:49.3281365Z #define _INITIALIZER_LIST 2025-05-07T19:46:49.3281627Z #define _IOFBF 0 2025-05-07T19:46:49.3281871Z #define _IOLBF 1 2025-05-07T19:46:49.3282092Z #define _IONBF 2 2025-05-07T19:46:49.3282432Z #define _IOS_APPEND 8 2025-05-07T19:46:49.3282673Z #define _IOS_ATEND 4 2025-05-07T19:46:49.3282935Z #define _IOS_BIN 128 2025-05-07T19:46:49.3283173Z #define _IOS_INPUT 1 2025-05-07T19:46:49.3283442Z #define _IOS_NOCREATE 32 2025-05-07T19:46:49.3283819Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:49.3284091Z #define _IOS_OUTPUT 2 2025-05-07T19:46:49.3284326Z #define _IOS_TRUNC 16 2025-05-07T19:46:49.3284583Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:49.3284905Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:49.3285246Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:49.3285526Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:49.3285788Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:49.3286077Z #define _IO_DEC 020 2025-05-07T19:46:49.3286302Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:49.3286589Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:49.3286847Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:49.3287101Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:49.3287344Z #define _IO_FIXED 010000 2025-05-07T19:46:49.3287620Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:49.3288049Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:49.3288341Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:49.3288661Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:49.3288983Z #define _IO_HEX 0100 2025-05-07T19:46:49.3289238Z #define _IO_INTERNAL 010 2025-05-07T19:46:49.3289495Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:49.3289779Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:49.3290053Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:49.3315258Z #define _IO_LEFT 02 2025-05-07T19:46:49.3315634Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:49.3315898Z #define _IO_LINKED 0x80 2025-05-07T19:46:49.3316200Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:49.3316481Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:49.3316756Z #define _IO_NO_READS 4 2025-05-07T19:46:49.3316982Z #define _IO_NO_WRITES 8 2025-05-07T19:46:49.3317215Z #define _IO_OCT 040 2025-05-07T19:46:49.3317623Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:49.3318066Z #define _IO_RIGHT 04 2025-05-07T19:46:49.3318305Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:49.3318559Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:49.3318806Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:49.3319051Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:49.3319305Z #define _IO_SKIPWS 01 2025-05-07T19:46:49.3319543Z #define _IO_STDIO 040000 2025-05-07T19:46:49.3319800Z #define _IO_STDIO_H 2025-05-07T19:46:49.3320047Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:49.3320319Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:49.3320591Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:49.3320877Z #define _IO_UNITBUF 020000 2025-05-07T19:46:49.3321154Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:49.3321411Z #define _IO_USER_BUF 1 2025-05-07T19:46:49.3321665Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:49.3321939Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:49.3322278Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:49.3322688Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:49.3323200Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:49.3323608Z #define _IO_file_flags _flags 2025-05-07T19:46:49.3323895Z #define _IO_flockfile(_fp) 2025-05-07T19:46:49.3324179Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:49.3324453Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:49.3324724Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:49.3325005Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:49.3325723Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:49.3326297Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:49.3326574Z #define _IO_off64_t __off64_t 2025-05-07T19:46:49.3326837Z #define _IO_off_t __off_t 2025-05-07T19:46:49.3327145Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:49.3329540Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:49.3330141Z #define _IO_pid_t __pid_t 2025-05-07T19:46:49.3330928Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:49.3331704Z #define _IO_size_t size_t 2025-05-07T19:46:49.3331946Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:49.3332229Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:49.3332592Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:49.3332943Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:49.3333252Z #define _IO_uid_t __uid_t 2025-05-07T19:46:49.3333510Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:49.3333764Z #define _IO_wint_t wint_t 2025-05-07T19:46:49.3334010Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:49.3334252Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:49.3334504Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:49.3334828Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:49.3335222Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:49.3335497Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:49.3335743Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:49.3336001Z #define _LINUX_LIMITS_H 2025-05-07T19:46:49.3336235Z #define _LP64 1 2025-05-07T19:46:49.3336458Z #define _MATH_H 1 2025-05-07T19:46:49.3336673Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:49.3336926Z #define _MOVE_H 1 2025-05-07T19:46:49.3337141Z #define _Mfloat_ float 2025-05-07T19:46:49.3337404Z #define _Mlong_double_ long double 2025-05-07T19:46:49.3337670Z #define _NEW 2025-05-07T19:46:49.3337908Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:49.3338189Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:49.3338467Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:49.3338738Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:49.3339002Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:49.3339302Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:49.3339587Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:49.3339876Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:49.3340145Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:49.3340415Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:49.3340674Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:49.3340948Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:49.3341187Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:49.3341449Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:49.3341721Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:49.3342008Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:49.3342290Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:49.3342576Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:49.3342901Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:49.3343181Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:49.3343465Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:49.3343713Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:49.3343987Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:49.3344240Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:49.3344511Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:49.3344783Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:49.3345040Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:49.3345304Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:49.3345556Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:49.3345816Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:49.3346066Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:49.3346323Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:49.3346645Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:49.3346930Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:49.3347181Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:49.3347470Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:49.3347768Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:49.3348208Z #define _POSIX_SOURCE 1 2025-05-07T19:46:49.3348551Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:49.3348824Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:49.3349108Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:49.3349380Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:49.3349697Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:49.3350027Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:49.3350344Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:49.3350634Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:49.3350919Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:49.3351206Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:49.3351578Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:49.3352126Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:49.3352717Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:49.3353365Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:46:49.3353881Z #define _PSTL_CONFIG_H 2025-05-07T19:46:49.3354370Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:49.3355261Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:49.3356091Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:49.3356911Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:49.3357910Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:49.3358687Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:49.3359170Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:49.3359691Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:49.3360176Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:49.3360472Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:49.3360854Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:49.3361307Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:49.3361702Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:49.3362000Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:46:49.3362691Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:49.3363488Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:49.3363898Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:49.3364276Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:49.3364651Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:49.3365366Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:49.3365936Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:49.3366313Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:49.3366684Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:49.3367023Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:49.3367409Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:49.3367783Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:49.3368229Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:49.3368941Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:49.3369429Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:49.3369760Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:49.3370089Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:49.3370428Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:49.3370797Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:49.3371120Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:46:49.3371576Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:49.3372097Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:49.3372417Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:49.3372776Z #define _PSTL_VERSION 12000 2025-05-07T19:46:49.3373100Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:49.3373503Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:49.3373918Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:49.3374250Z #define _PTRDIFF_T 2025-05-07T19:46:49.3374502Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:49.3374759Z #define _SIGSET_H_types 1 2025-05-07T19:46:49.3375122Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:49.3375503Z #define _SIZE_T 2025-05-07T19:46:49.3375760Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:49.3376013Z #define _STDIO_H 1 2025-05-07T19:46:49.3376271Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:49.3376669Z #define _STDLIB_H 1 2025-05-07T19:46:49.3376895Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:49.3377285Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:49.3377567Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:49.3377855Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:49.3378087Z #define _STL_PAIR_H 1 2025-05-07T19:46:49.3378329Z #define _STL_RELOPS_H 1 2025-05-07T19:46:49.3378555Z #define _STRING_H 1 2025-05-07T19:46:49.3378786Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:49.3379020Z #define _SVID_SOURCE 1 2025-05-07T19:46:49.3379267Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:49.3379494Z #define _SYS_SELECT_H 1 2025-05-07T19:46:49.3379745Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:49.3380001Z #define _SYS_TYPES_H 1 2025-05-07T19:46:49.3380222Z #define _TIME_H 1 2025-05-07T19:46:49.3380456Z #define _VA_LIST_DEFINED 2025-05-07T19:46:49.3380693Z #define _XLOCALE_H 1 2025-05-07T19:46:49.3381125Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:49.3381423Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:49.3381680Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:49.3381943Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:49.3382325Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:49.3382790Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:49.3383186Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:49.3383527Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:49.3383854Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:49.3384123Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:49.3384378Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:49.3384647Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:49.3384896Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:49.3385162Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:49.3385424Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:49.3385724Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:49.3385990Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:49.3386256Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:49.3386530Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:49.3386781Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:46:49.3387068Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:49.3387341Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:49.3387655Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:49.3387982Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:49.3388290Z #define __BOOL_WIDTH__ 8 2025-05-07T19:46:49.3388535Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:49.3388937Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:49.3389270Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:49.3389553Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:46:49.3389850Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:49.3390117Z #define __CHAR_BIT__ 8 2025-05-07T19:46:49.3390426Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:49.3390730Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:49.3391055Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:49.3391362Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:49.3391951Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:49.3392267Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:49.3392576Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:49.3392889Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:49.3393208Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:49.3393533Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:49.3393837Z #define __CLANG_LIMITS_H 2025-05-07T19:46:49.3394103Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:46:49.3394390Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:49.3394697Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:49.3395006Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:49.3395280Z #define __COMPAR_FN_T 2025-05-07T19:46:49.3395525Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:49.3395789Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:46:49.3396069Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:49.3396337Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:49.3396607Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:49.3397223Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:49.3397858Z #define __CUDACC__ 1 2025-05-07T19:46:49.3398097Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:49.3398398Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:49.3398859Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:49.3399336Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:49.3399621Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:49.3399976Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_##_FEAT 2025-05-07T19:46:49.3400371Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:49.3400636Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:49.3400898Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:49.3401191Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:49.3401458Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:49.3401726Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:49.3401976Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:49.3402253Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:49.3402538Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:46:49.3402858Z #define __DBL_DIG__ 15 2025-05-07T19:46:49.3403117Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:46:49.3403436Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:49.3403696Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:49.3403974Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:49.3404336Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:49.3404580Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:49.3404829Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:49.3405077Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:46:49.3405358Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:49.3405612Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:49.3405861Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:46:49.3406151Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:46:49.3406443Z #define __DELETE_THROW throw() 2025-05-07T19:46:49.3406683Z #define __DEPRECATED 1 2025-05-07T19:46:49.3406928Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3407227Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:49.3407509Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3407874Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:49.3408156Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3408619Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:49.3408890Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:49.3409192Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:49.3409531Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:49.3409812Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:49.3410067Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:49.3410312Z #define __ELF__ 1 2025-05-07T19:46:49.3410526Z #define __END_DECLS } 2025-05-07T19:46:49.3410758Z #define __END_NAMESPACE_C99 2025-05-07T19:46:49.3411014Z #define __END_NAMESPACE_STD 2025-05-07T19:46:49.3411255Z #define __EXCEPTIONS 1 2025-05-07T19:46:49.3411484Z #define __EXCEPTION_H 1 2025-05-07T19:46:49.3411727Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:49.3412131Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:49.3412537Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:49.3412926Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:49.3413371Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:49.3413818Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:49.3414224Z #define __FD_SETSIZE 1024 2025-05-07T19:46:49.3414886Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:49.3415606Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:49.3415866Z #define __FILE_defined 1 2025-05-07T19:46:49.3416118Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:49.3416368Z #define __FLOAT128__ 1 2025-05-07T19:46:49.3416619Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:49.3416918Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:46:49.3417219Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:46:49.3417541Z #define __FLT16_DIG__ 3 2025-05-07T19:46:49.3417782Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:46:49.3418076Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:46:49.3418329Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:46:49.3418606Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:46:49.3418875Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:46:49.3419135Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:46:49.3419399Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:46:49.3419650Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:46:49.3419926Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:46:49.3420193Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:46:49.3420461Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:46:49.3420850Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:49.3421110Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:46:49.3421372Z #define __FLT_DIG__ 6 2025-05-07T19:46:49.3421603Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:46:49.3421877Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:49.3422122Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:49.3422372Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:49.3422609Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:49.3422851Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:49.3423090Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:49.3423331Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:46:49.3423593Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:49.3423851Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:49.3424098Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:46:49.3424352Z #define __FLT_RADIX__ 2 2025-05-07T19:46:49.3424587Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:49.3424897Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:49.3425214Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:49.3425516Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:49.3425909Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:49.3426229Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:49.3426510Z #define __FXSR__ 1 2025-05-07T19:46:49.3426880Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:49.3427157Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:49.3427443Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:49.3427801Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:49.3428085Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:49.3428363Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:49.3428647Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:49.3428927Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:49.3429210Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:49.3429492Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:49.3429800Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:49.3430109Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:49.3430412Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:49.3430704Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:49.3431032Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:49.3431359Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:49.3431761Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:49.3432273Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:49.3432565Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:49.3432891Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:49.3433196Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:49.3433489Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:49.3433769Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:49.3434200Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:49.3434665Z #define __GLIBC__ 2 2025-05-07T19:46:49.3434898Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:46:49.3435180Z #define __GNUC_MINOR__ 2 2025-05-07T19:46:49.3435437Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:46:49.3435864Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:49.3436303Z #define __GNUC_VA_LIST 2025-05-07T19:46:49.3436554Z #define __GNUC__ 4 2025-05-07T19:46:49.3436773Z #define __GNUG__ 4 2025-05-07T19:46:49.3437013Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:49.3437276Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:46:49.3437572Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:49.3437877Z #define __GXX_RTTI 1 2025-05-07T19:46:49.3438110Z #define __GXX_WEAK__ 1 2025-05-07T19:46:49.3438356Z #define __HAVE_COLUMN 2025-05-07T19:46:49.3438596Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:49.3438865Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:49.3439125Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:49.3439412Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:49.3439707Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:49.3440017Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:49.3440325Z #define __INT16_C_SUFFIX__ 2025-05-07T19:46:49.3440595Z #define __INT16_FMTd__ "hd" 2025-05-07T19:46:49.3440863Z #define __INT16_FMTi__ "hi" 2025-05-07T19:46:49.3441114Z #define __INT16_MAX__ 32767 2025-05-07T19:46:49.3441388Z #define __INT16_TYPE__ short 2025-05-07T19:46:49.3441650Z #define __INT32_C_SUFFIX__ 2025-05-07T19:46:49.3441922Z #define __INT32_FMTd__ "d" 2025-05-07T19:46:49.3442174Z #define __INT32_FMTi__ "i" 2025-05-07T19:46:49.3442442Z #define __INT32_MAX__ 2147483647 2025-05-07T19:46:49.3442712Z #define __INT32_TYPE__ int 2025-05-07T19:46:49.3442988Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:46:49.3443253Z #define __INT64_FMTd__ "ld" 2025-05-07T19:46:49.3443531Z #define __INT64_FMTi__ "li" 2025-05-07T19:46:49.3443805Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:46:49.3444125Z #define __INT64_TYPE__ long int 2025-05-07T19:46:49.3444526Z #define __INT8_C_SUFFIX__ 2025-05-07T19:46:49.3444773Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:46:49.3445039Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:46:49.3445369Z #define __INT8_MAX__ 127 2025-05-07T19:46:49.3445643Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:49.3445921Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:46:49.3446207Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:46:49.3446461Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:46:49.3446739Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:46:49.3447095Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:49.3447370Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:49.3447628Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:46:49.3447875Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:46:49.3448140Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:46:49.3448434Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:49.3448707Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:49.3448955Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:46:49.3449221Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:46:49.3449477Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:46:49.3449581Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:46:49.3449675Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:46:49.3449763Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:46:49.3449864Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:46:49.3449957Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:46:49.3450050Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:46:49.3450141Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:46:49.3450244Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:46:49.3450335Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:46:49.3450450Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:49.3450560Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:49.3450653Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:49.3450748Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:46:49.3450839Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:46:49.3450939Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:46:49.3451046Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:49.3451140Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:49.3451246Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:46:49.3451339Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:46:49.3451435Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:46:49.3451530Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:46:49.3451626Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:49.3451726Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:46:49.3451818Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:46:49.3452038Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:46:49.3452124Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:49.3452211Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:49.3452298Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:46:49.3452399Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:46:49.3452510Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:49.3452606Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:49.3452701Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:49.3452790Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:46:49.3452883Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:46:49.3452973Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:46:49.3453080Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:49.3453166Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:49.3453255Z #define __INT_MAX__ 2147483647 2025-05-07T19:46:49.3453350Z #define __INT_WIDTH__ 32 2025-05-07T19:46:49.3453442Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:49.3453535Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:49.3453622Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:49.3453765Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:46:49.3453845Z #define __LDBL_DIG__ 18 2025-05-07T19:46:49.3453961Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:46:49.3454056Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:49.3454144Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:49.3454233Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:49.3454325Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:49.3454463Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:49.3454551Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:49.3454660Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:46:49.3454758Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:49.3454846Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:49.3454999Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:46:49.3455116Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:49.3455240Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:49.3455404Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:49.3455492Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:49.3455638Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:49.3455713Z #define __LEAF 2025-05-07T19:46:49.3455796Z #define __LEAF_ATTR 2025-05-07T19:46:49.3455896Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:49.3455979Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:49.3456068Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:46:49.3456155Z #define __LLONG_WIDTH__ 64 2025-05-07T19:46:49.3456268Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:46:49.3456363Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:49.3456456Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:46:49.3456557Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:49.3456633Z #define __LP64__ 1 2025-05-07T19:46:49.3456945Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:49.3457570Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:49.3457661Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:49.3457755Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3457844Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:49.3457935Z #define __MMX__ 1 2025-05-07T19:46:49.3458028Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:49.3458112Z #define __N(msgid) (msgid) 2025-05-07T19:46:49.3458237Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:49.3458345Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:49.3458422Z #define __NO_CTYPE 1 2025-05-07T19:46:49.3458503Z #define __NO_INLINE__ 1 2025-05-07T19:46:49.3458604Z #define __NO_MATH_INLINES 1 2025-05-07T19:46:49.3458708Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:49.3458805Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:49.3458900Z #define __NVCC__ 1 2025-05-07T19:46:49.3458992Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:46:49.3459079Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:49.3459175Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:49.3459284Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:46:49.3459373Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:49.3459465Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:49.3459580Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:49.3459704Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:46:49.3459798Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:46:49.3459900Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:46:49.3460013Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:46:49.3460115Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:46:49.3460205Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:49.3460302Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:49.3460392Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:49.3460473Z #define __P(args) args 2025-05-07T19:46:49.3460568Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:49.3460642Z #define __PIC__ 2 2025-05-07T19:46:49.3460731Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:49.3460805Z #define __PIE__ 2 2025-05-07T19:46:49.3460897Z #define __PMT(args) args 2025-05-07T19:46:49.3460988Z #define __POINTER_WIDTH__ 64 2025-05-07T19:46:49.3461081Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:49.3461240Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:49.3461360Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:49.3461450Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:49.3461539Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:46:49.3461639Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:46:49.3461743Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:46:49.3461887Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:49.3461982Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:49.3462192Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:49.3462391Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:49.3462631Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:49.3462900Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:49.3463128Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:49.3463220Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:49.3463323Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:49.3463427Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:49.3463514Z #define __S16_TYPE short int 2025-05-07T19:46:49.3463608Z #define __S32_TYPE int 2025-05-07T19:46:49.3463701Z #define __S64_TYPE long int 2025-05-07T19:46:49.3463787Z #define __SCHAR_MAX__ 127 2025-05-07T19:46:49.3463866Z #define __SEG_FS 1 2025-05-07T19:46:49.3463950Z #define __SEG_GS 1 2025-05-07T19:46:49.3464031Z #define __SHRT_MAX__ 32767 2025-05-07T19:46:49.3464114Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:49.3464209Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:46:49.3464304Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:49.3464394Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:49.3464486Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:49.3464577Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:49.3464807Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:49.3464891Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:49.3464986Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:49.3465267Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:49.3465362Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:49.3465459Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:49.3465577Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:49.3465695Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:49.3465803Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:49.3465929Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:49.3466033Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:49.3466140Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:49.3466246Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:49.3466369Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:49.3466470Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:49.3466570Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:49.3466674Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:49.3466768Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:49.3466867Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:49.3466966Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:49.3467074Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:46:49.3467167Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:46:49.3467260Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:46:49.3467369Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:46:49.3467476Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:46:49.3467582Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:49.3467677Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:49.3467780Z #define __SLONG32_TYPE int 2025-05-07T19:46:49.3467882Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:49.3467991Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3468114Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:49.3468213Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:49.3468312Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:49.3468410Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:49.3468626Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:49.3468736Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3468841Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:49.3468951Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:49.3469052Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:49.3469228Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:49.3469332Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:49.3469458Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3469573Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:49.3469675Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:49.3469790Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:49.3469888Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:49.3469979Z #define __SM_70_RT_H__ 2025-05-07T19:46:49.3470069Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:49.3470175Z #define __SM_80_RT_H__ 2025-05-07T19:46:49.3470270Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:49.3470361Z #define __SM_90_RT_H__ 2025-05-07T19:46:49.3470480Z #define __SQUAD_TYPE long int 2025-05-07T19:46:49.3470575Z #define __SSE2_MATH__ 1 2025-05-07T19:46:49.3470661Z #define __SSE2__ 1 2025-05-07T19:46:49.3470748Z #define __SSE_MATH__ 1 2025-05-07T19:46:49.3470847Z #define __SSE__ 1 2025-05-07T19:46:49.3470955Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:49.3471086Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:46:49.3471221Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:49.3471326Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:49.3471496Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:49.3471601Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:49.3471720Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:49.3471821Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:49.3471920Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:49.3472027Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:49.3472118Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:49.3472200Z #define __STDC__ 1 2025-05-07T19:46:49.3472285Z #define __STDDEF_H 2025-05-07T19:46:49.3472384Z #define __STRING(x) #x 2025-05-07T19:46:49.3472498Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:49.3472593Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:49.3472729Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:49.3472825Z #define __SWORD_TYPE long int 2025-05-07T19:46:49.3472951Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:49.3473075Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:49.3473177Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:49.3473289Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:49.3473384Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:49.3473478Z #define __THROW throw () 2025-05-07T19:46:49.3473569Z #define __THROWNL throw () 2025-05-07T19:46:49.3473668Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:49.3473781Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:49.3473893Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:49.3473987Z #define __U32_TYPE unsigned int 2025-05-07T19:46:49.3474093Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:49.3474195Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:49.3474288Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:46:49.3474378Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:46:49.3474468Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:46:49.3474574Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:46:49.3474662Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:46:49.3474749Z #define __UINT16_MAX__ 65535 2025-05-07T19:46:49.3474866Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:46:49.3474955Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:46:49.3475041Z #define __UINT32_FMTX__ "X" 2025-05-07T19:46:49.3475133Z #define __UINT32_FMTo__ "o" 2025-05-07T19:46:49.3475239Z #define __UINT32_FMTu__ "u" 2025-05-07T19:46:49.3475325Z #define __UINT32_FMTx__ "x" 2025-05-07T19:46:49.3475429Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:46:49.3475546Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:49.3475640Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:46:49.3475795Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:46:49.3475891Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:46:49.3475997Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:46:49.3476088Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:46:49.3476200Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:46:49.3476399Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:49.3476489Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:46:49.3476584Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:46:49.3476685Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:46:49.3476776Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:46:49.3476870Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:46:49.3476958Z #define __UINT8_MAX__ 255 2025-05-07T19:46:49.3477078Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:49.3477175Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:46:49.3477270Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:46:49.3477378Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:46:49.3477474Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:46:49.3477570Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:46:49.3477682Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:46:49.3477815Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:49.3477908Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:46:49.3478001Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:46:49.3478116Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:46:49.3478208Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:46:49.3478304Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:46:49.3478419Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:46:49.3478544Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:49.3478639Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:46:49.3478736Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:46:49.3478845Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:46:49.3478939Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:46:49.3479033Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:46:49.3479124Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:46:49.3479249Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:46:49.3479347Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:46:49.3479442Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:46:49.3479547Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:46:49.3479642Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:46:49.3479745Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:46:49.3479851Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:46:49.3479957Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:46:49.3480056Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:46:49.3480149Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:46:49.3480251Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:46:49.3480378Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:49.3480499Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:49.3480591Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:46:49.3480702Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:46:49.3480800Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:46:49.3480893Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:46:49.3480996Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:46:49.3481104Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:49.3481201Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:46:49.3481310Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:46:49.3481408Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:46:49.3481504Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:46:49.3481596Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:46:49.3481722Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:46:49.3481818Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:46:49.3481916Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:46:49.3482019Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:46:49.3482113Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:46:49.3482211Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:46:49.3482323Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:49.3482476Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:46:49.3482568Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:46:49.3482665Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:46:49.3482768Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:46:49.3482889Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:49.3483065Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:49.3483165Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:46:49.3483266Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:46:49.3483364Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:46:49.3483455Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:46:49.3483557Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:46:49.3483670Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:49.3483771Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:49.3484003Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:49.3484119Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:49.3484215Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:49.3484305Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:49.3484397Z #define __USE_ANSI 1 2025-05-07T19:46:49.3484478Z #define __USE_ATFILE 1 2025-05-07T19:46:49.3484555Z #define __USE_BSD 1 2025-05-07T19:46:49.3484645Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:49.3484739Z #define __USE_GNU 1 2025-05-07T19:46:49.3484815Z #define __USE_ISOC11 1 2025-05-07T19:46:49.3484898Z #define __USE_ISOC95 1 2025-05-07T19:46:49.3484987Z #define __USE_ISOC99 1 2025-05-07T19:46:49.3485069Z #define __USE_ISOCXX11 1 2025-05-07T19:46:49.3485155Z #define __USE_LARGEFILE 1 2025-05-07T19:46:49.3485239Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:49.3485327Z #define __USE_MISC 1 2025-05-07T19:46:49.3485406Z #define __USE_POSIX 1 2025-05-07T19:46:49.3485495Z #define __USE_POSIX199309 1 2025-05-07T19:46:49.3485594Z #define __USE_POSIX199506 1 2025-05-07T19:46:49.3485673Z #define __USE_POSIX2 1 2025-05-07T19:46:49.3485751Z #define __USE_SVID 1 2025-05-07T19:46:49.3485831Z #define __USE_UNIX98 1 2025-05-07T19:46:49.3485924Z #define __USE_XOPEN 1 2025-05-07T19:46:49.3486006Z #define __USE_XOPEN2K 1 2025-05-07T19:46:49.3486090Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:49.3486192Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:49.3486279Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:49.3486366Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:49.3486465Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:49.3486577Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:49.3486675Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:49.3486770Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:49.3486877Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:49.3486966Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:49.3487385Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:49.3487499Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:49.3487602Z #define __WAIT_STATUS void * 2025-05-07T19:46:49.3487694Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:49.3487780Z #define __WALL 0x40000000 2025-05-07T19:46:49.3487874Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:46:49.3487958Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:49.3488041Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:49.3488120Z #define __WCLONE 0x80000000 2025-05-07T19:46:49.3488255Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:49.3488337Z #define __WCOREFLAG 0x80 2025-05-07T19:46:49.3488473Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:49.3488632Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:49.3488765Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:49.3488972Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:49.3489120Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:49.3489214Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:46:49.3489374Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:49.3489462Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:46:49.3489561Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:49.3489645Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:49.3489728Z #define __WORDSIZE 64 2025-05-07T19:46:49.3489841Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:49.3490391Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:49.3490495Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:49.3490587Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:49.3490712Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:49.3490816Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:49.3490896Z #define ____FILE_defined 1 2025-05-07T19:46:49.3490999Z #define ____mbstate_t_defined 1 2025-05-07T19:46:49.3491113Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:49.3491293Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:49.3491372Z #define __amd64 1 2025-05-07T19:46:49.3491462Z #define __amd64__ 1 2025-05-07T19:46:49.3491563Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:49.3491663Z #define __attribute_artificial__ 2025-05-07T19:46:49.3491806Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:49.3491983Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:49.3492175Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:49.3492425Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:49.3492564Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:49.3492718Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:49.3492843Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:49.3492973Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:49.3493196Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:49.3493283Z #define __blkcnt_t_defined 2025-05-07T19:46:49.3493384Z #define __blksize_t_defined 2025-05-07T19:46:49.3493568Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:49.3493692Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:49.3493787Z #define __bounded 2025-05-07T19:46:49.3494374Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:49.3494843Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:49.3495304Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:49.3495552Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:49.3495865Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:49.3496785Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:49.3496890Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:49.3496980Z #define __catch(X) catch(X) 2025-05-07T19:46:49.3497071Z #define __cdecl 2025-05-07T19:46:49.3497151Z #define __clang__ 1 2025-05-07T19:46:49.3497258Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:46:49.3497400Z #define __clang_major__ 16 2025-05-07T19:46:49.3497495Z #define __clang_minor__ 0 2025-05-07T19:46:49.3497589Z #define __clang_patchlevel__ 6 2025-05-07T19:46:49.3497989Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:49.3498173Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:46:49.3498261Z #define __clock_t_defined 1 2025-05-07T19:46:49.3498351Z #define __clockid_t_defined 1 2025-05-07T19:46:49.3498550Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:49.3498642Z #define __code_model_small__ 1 2025-05-07T19:46:49.3498745Z #define __constant__ __location__(constant) 2025-05-07T19:46:49.3498834Z #define __cplusplus 201703L 2025-05-07T19:46:49.3498947Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:49.3499043Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:49.3499137Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:49.3499232Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:49.3499324Z #define __cpp_attributes 200809L 2025-05-07T19:46:49.3499417Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:49.3499514Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:49.3499612Z #define __cpp_constexpr 201603L 2025-05-07T19:46:49.3499724Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:46:49.3499812Z #define __cpp_decltype 200707L 2025-05-07T19:46:49.3499922Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:49.3500020Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:49.3500139Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:49.3500238Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:49.3500362Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:49.3500456Z #define __cpp_exceptions 199711L 2025-05-07T19:46:49.3500557Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:49.3500678Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:49.3500809Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:49.3500905Z #define __cpp_hex_float 201603L 2025-05-07T19:46:49.3501027Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:49.3501147Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:46:49.3501269Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:49.3501375Z #define __cpp_init_captures 201304L 2025-05-07T19:46:49.3501510Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:49.3501620Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:49.3501712Z #define __cpp_lambdas 200907L 2025-05-07T19:46:49.3501847Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:49.3501956Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:49.3502054Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:49.3502155Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:49.3502280Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:49.3502433Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:49.3502526Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:49.3502636Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:49.3502764Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:49.3502863Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:49.3502953Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:49.3503063Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:49.3503165Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:49.3503263Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:49.3503364Z #define __cpp_lib_launder 201606 2025-05-07T19:46:49.3503463Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:49.3503587Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:49.3503716Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:49.3503992Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:49.3504131Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:49.3504331Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:49.3504450Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:49.3504558Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:49.3504707Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:49.3504809Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:49.3504984Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:46:49.3505101Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:49.3505235Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:49.3505369Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:49.3505484Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:49.3505627Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:49.3505734Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:49.3505840Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:49.3505947Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:49.3506056Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:49.3506176Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:49.3506267Z #define __cpp_rtti 199711L 2025-05-07T19:46:49.3506372Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:49.3506476Z #define __cpp_static_assert 201411L 2025-05-07T19:46:49.3506586Z #define __cpp_static_call_operator 202207L 2025-05-07T19:46:49.3506700Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:49.3506808Z #define __cpp_template_auto 201606L 2025-05-07T19:46:49.3506924Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:49.3507030Z #define __cpp_unicode_characters 200704L 2025-05-07T19:46:49.3507134Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:49.3507255Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:49.3507361Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:49.3507468Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:49.3507578Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:49.3507688Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:49.3507801Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:49.3507902Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:49.3508033Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:49.3508143Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:49.3508250Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:49.3508362Z #define __cudaCDP2EventRecord 2025-05-07T19:46:49.3508472Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:49.3508596Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:49.3508702Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:49.3508801Z #define __cudaCDP2Free 2025-05-07T19:46:49.3508906Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:49.3508998Z #define __cudaCDP2GetDevice 2025-05-07T19:46:49.3509106Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:49.3509202Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:49.3509302Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:49.3509402Z #define __cudaCDP2GetLastError 2025-05-07T19:46:49.3509521Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:49.3509632Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:49.3509726Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:49.3509834Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:49.3509942Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:49.3510044Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:49.3510131Z #define __cudaCDP2Malloc 2025-05-07T19:46:49.3510237Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:49.3510338Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:49.3510434Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:49.3510549Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:49.3510652Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:49.3510757Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:49.3510864Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:49.3510972Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:49.3511123Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:49.3511229Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:49.3511343Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:49.3511539Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:49.3511738Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:49.3512246Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:49.3512355Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:49.3512547Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:49.3512663Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:49.3512780Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:49.3512885Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:49.3513001Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:49.3513108Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:49.3513210Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:49.3513310Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:49.3513424Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:49.3513539Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:49.3513688Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:49.3513783Z #define __daddr_t_defined 2025-05-07T19:46:49.3513881Z #define __dev_t_defined 2025-05-07T19:46:49.3513987Z #define __device__ __location__(device) 2025-05-07T19:46:49.3514133Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:49.3514385Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:49.3514622Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:49.3514765Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:46:49.3514904Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:49.3515102Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:49.3515190Z #define __export__ 2025-05-07T19:46:49.3515452Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:49.3515669Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:49.3515752Z #define __flexarr [] 2025-05-07T19:46:49.3515932Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:49.3516166Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:49.3516265Z #define __fsblkcnt_t_defined 2025-05-07T19:46:49.3516359Z #define __fsfilcnt_t_defined 2025-05-07T19:46:49.3516447Z #define __gid_t_defined 2025-05-07T19:46:49.3516617Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:49.3516776Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:49.3517020Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:49.3517142Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:49.3517268Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:49.3517395Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:49.3517527Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:49.3517915Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:49.3518125Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:49.3518298Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:49.3518428Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:49.3518535Z #define __glibcxx_integral_traps true 2025-05-07T19:46:49.3518854Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:49.3519119Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:49.3519327Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:49.3519535Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:49.3519755Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:49.3519876Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:49.3520004Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:49.3520214Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:49.3520365Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:49.3520517Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:49.3520708Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:49.3520906Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:49.3521065Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:49.3521177Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:49.3521386Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:49.3521622Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:49.3521818Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:49.3522055Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:49.3522199Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:49.3522367Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:49.3522540Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:49.3522767Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:49.3522884Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:49.3523026Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:49.3523156Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:49.3523297Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:49.3523414Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:49.3523518Z #define __global__ __location__(global) 2025-05-07T19:46:49.3523633Z #define __gnu_linux__ 1 2025-05-07T19:46:49.3523771Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:49.3523873Z #define __have_pthread_attr_t 1 2025-05-07T19:46:49.3523992Z #define __host__ __location__(host) 2025-05-07T19:46:49.3524199Z #define __id_t_defined 2025-05-07T19:46:49.3524280Z #define __import__ 2025-05-07T19:46:49.3524418Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:49.3524568Z #define __ino64_t_defined 2025-05-07T19:46:49.3524664Z #define __ino_t_defined 2025-05-07T19:46:49.3524748Z #define __int8_t_defined 2025-05-07T19:46:49.3524959Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:49.3525096Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:49.3525243Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:49.3525341Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:49.3525448Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:49.3525592Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:49.3525727Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:49.3525989Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:49.3526134Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:49.3526269Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:49.3526456Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:49.3526595Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:49.3526733Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:49.3526871Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:49.3527060Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:49.3527209Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:49.3527353Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:49.3527433Z #define __k8 1 2025-05-07T19:46:49.3527519Z #define __k8__ 1 2025-05-07T19:46:49.3527652Z #define __key_t_defined 2025-05-07T19:46:49.3527837Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:49.3527925Z #define __ldiv_t_defined 1 2025-05-07T19:46:49.3528018Z #define __linux 1 2025-05-07T19:46:49.3528181Z #define __linux__ 1 2025-05-07T19:46:49.3528267Z #define __lldiv_t_defined 1 2025-05-07T19:46:49.3528536Z #define __llvm__ 1 2025-05-07T19:46:49.3528637Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:49.3528741Z #define __long_double_t long double 2025-05-07T19:46:49.3528843Z #define __malloc_and_calloc_defined 2025-05-07T19:46:49.3528958Z #define __managed__ __location__(managed) 2025-05-07T19:46:49.3529088Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:49.3529176Z #define __mode_t_defined 2025-05-07T19:46:49.3529272Z #define __need_IOV_MAX 2025-05-07T19:46:49.3529361Z #define __need_clockid_t 2025-05-07T19:46:49.3529448Z #define __nlink_t_defined 2025-05-07T19:46:49.3529566Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:49.3529695Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:49.3529860Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:49.3529964Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:49.3530063Z #define __off64_t_defined 2025-05-07T19:46:49.3530302Z #define __off_t_defined 2025-05-07T19:46:49.3530380Z #define __pic__ 2 2025-05-07T19:46:49.3530465Z #define __pid_t_defined 2025-05-07T19:46:49.3530551Z #define __pie__ 2 2025-05-07T19:46:49.3530646Z #define __private_extern__ extern 2025-05-07T19:46:49.3530729Z #define __ptr_t void * 2025-05-07T19:46:49.3530822Z #define __ptrvalue 2025-05-07T19:46:49.3530912Z #define __restrict_arr 2025-05-07T19:46:49.3531047Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:46:49.3531180Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:46:49.3531292Z #define __shared__ __location__(shared) 2025-05-07T19:46:49.3531382Z #define __sigset_t_defined 2025-05-07T19:46:49.3531485Z #define __specialization_static 2025-05-07T19:46:49.3531587Z #define __ssize_t_defined 2025-05-07T19:46:49.3531674Z #define __stub_bdflush 2025-05-07T19:46:49.3531758Z #define __stub_chflags 2025-05-07T19:46:49.3531842Z #define __stub_fattach 2025-05-07T19:46:49.3531948Z #define __stub_fchflags 2025-05-07T19:46:49.3532036Z #define __stub_fdetach 2025-05-07T19:46:49.3532117Z #define __stub_getmsg 2025-05-07T19:46:49.3532212Z #define __stub_gtty 2025-05-07T19:46:49.3532297Z #define __stub_lchmod 2025-05-07T19:46:49.3532383Z #define __stub_putmsg 2025-05-07T19:46:49.3532464Z #define __stub_revoke 2025-05-07T19:46:49.3532564Z #define __stub_setlogin 2025-05-07T19:46:49.3532658Z #define __stub_sigreturn 2025-05-07T19:46:49.3532736Z #define __stub_sstk 2025-05-07T19:46:49.3532829Z #define __stub_stty 2025-05-07T19:46:49.3532925Z #define __suseconds_t_defined 2025-05-07T19:46:49.3533013Z #define __thread__ __thread 2025-05-07T19:46:49.3533117Z #define __throw_exception_again throw 2025-05-07T19:46:49.3533216Z #define __time_t_defined 1 2025-05-07T19:46:49.3533308Z #define __timer_t_defined 1 2025-05-07T19:46:49.3533402Z #define __timespec_defined 1 2025-05-07T19:46:49.3533505Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:49.3533618Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:49.3534180Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:49.3534271Z #define __try try 2025-05-07T19:46:49.3534351Z #define __tune_k8__ 1 2025-05-07T19:46:49.3534548Z #define __u_char_defined 2025-05-07T19:46:49.3534818Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:49.3534915Z #define __uid_t_defined 2025-05-07T19:46:49.3534999Z #define __unbounded 2025-05-07T19:46:49.3535079Z #define __unix 1 2025-05-07T19:46:49.3535224Z #define __unix__ 1 2025-05-07T19:46:49.3535316Z #define __useconds_t_defined 2025-05-07T19:46:49.3535403Z #define __warnattr(msg) 2025-05-07T19:46:49.3535540Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:46:49.3535634Z #define __wur 2025-05-07T19:46:49.3535722Z #define __x86_64 1 2025-05-07T19:46:49.3535804Z #define __x86_64__ 1 2025-05-07T19:46:49.3535980Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:49.3536149Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:49.3536265Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:49.3536622Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:49.3537035Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:49.3537135Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:49.3537234Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:49.3537342Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:49.3537453Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:49.3537552Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:49.3537666Z #define cudaArrayDefault 0x00 2025-05-07T19:46:49.3537776Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:49.3537873Z #define cudaArrayLayered 0x01 2025-05-07T19:46:49.3537971Z #define cudaArraySparse 0x40 2025-05-07T19:46:49.3538134Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:49.3538243Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:49.3538351Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:49.3538545Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:49.3538712Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:49.3538815Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:49.3538925Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:49.3539050Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:49.3539149Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:49.3539244Z #define cudaDeviceMask 0xff 2025-05-07T19:46:49.3539364Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:49.3539487Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:49.3539595Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:49.3539704Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:49.3539827Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:49.3539928Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:49.3540032Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:49.3540143Z #define cudaEventDefault 0x00 2025-05-07T19:46:49.3540254Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:49.3540358Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:49.3540478Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:49.3540591Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:49.3540691Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:49.3540796Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:49.3540923Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:49.3541123Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:49.3541307Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:49.3541498Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:49.3541624Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:49.3541769Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:49.3541902Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:49.3542017Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:49.3542179Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:49.3542398Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:49.3542520Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:49.3542622Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:49.3542727Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:49.3542894Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:49.3543010Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:49.3543116Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:49.3543223Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:49.3543333Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:49.3543464Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:49.3543605Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:49.3543792Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:49.3544109Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:49.3544407Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:49.3544889Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:49.3545151Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:49.3545544Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:49.3545809Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:49.3546124Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:49.3546556Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:49.3546775Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:49.3546898Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:49.3547004Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:49.3547108Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:49.3547233Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:49.3547341Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:49.3547448Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:49.3547592Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:49.3547720Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:49.3548059Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:49.3548193Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:49.3548362Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:49.3548656Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:49.3548902Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:49.3549178Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:49.3549397Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:49.3549722Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:49.3549831Z #define cudaStreamDefault 0x00 2025-05-07T19:46:49.3549990Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:49.3550248Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:49.3550458Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:49.3550729Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:49.3550925Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:49.3551050Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:49.3551177Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:49.3551309Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:49.3551576Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:49.3551687Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:49.3551987Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:49.3552098Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:49.3552217Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:49.3552342Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:49.3552516Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:49.3552650Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:49.3552831Z #define cudaTextureType1D 0x01 2025-05-07T19:46:49.3552967Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:49.3553075Z #define cudaTextureType2D 0x02 2025-05-07T19:46:49.3553193Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:49.3553321Z #define cudaTextureType3D 0x03 2025-05-07T19:46:49.3553437Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:49.3553566Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:49.3553915Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:49.3554033Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:49.3554130Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:49.3554227Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:49.3554340Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:49.3554431Z #define htole16(x) (x) 2025-05-07T19:46:49.3554526Z #define htole32(x) (x) 2025-05-07T19:46:49.3554615Z #define htole64(x) (x) 2025-05-07T19:46:49.3554750Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:49.3554872Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:49.3554974Z #define isascii(c) __isascii (c) 2025-05-07T19:46:49.3555105Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:49.3555222Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:49.3555344Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:49.3555474Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:49.3555592Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:49.3555716Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:49.3555838Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:49.3555976Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:49.3556097Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:49.3556223Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:49.3556369Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:49.3556464Z #define le16toh(x) (x) 2025-05-07T19:46:49.3556559Z #define le32toh(x) (x) 2025-05-07T19:46:49.3556655Z #define le64toh(x) (x) 2025-05-07T19:46:49.3556759Z #define linux 1 2025-05-07T19:46:49.3556875Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:49.3557017Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:49.3557193Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:49.3557309Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:49.3557439Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:46:49.3557562Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:49.3557674Z #define stderr stderr 2025-05-07T19:46:49.3557768Z #define stdin stdin 2025-05-07T19:46:49.3557866Z #define stdout stdout 2025-05-07T19:46:49.3558400Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:49.3558970Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:49.3559080Z #define toascii(c) __toascii (c) 2025-05-07T19:46:49.3559218Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:49.3559310Z #define unix 1 2025-05-07T19:46:49.3559452Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:49.3559599Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:49.3559784Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:49.3559908Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:49.3560042Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:49.3560049Z 2025-05-07T19:46:49.3652743Z 2025-05-07T19:46:49.3653709Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:49.3653730Z 2025-05-07T19:46:50.9504727Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:50.9505149Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:50.9505498Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:50.9505825Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:50.9506191Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:50.9506405Z 2025-05-07T19:46:51.0289149Z 2025-05-07T19:46:51.0299084Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:51.0301581Z [CHECK] nvidia-smi not found 2025-05-07T19:46:51.0301930Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:51.0403461Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:51.0404059Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:51.0404743Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:51.0405060Z env: 2025-05-07T19:46:51.0405287Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:51.0405575Z BUILD_ENV: build_binary 2025-05-07T19:46:51.0405819Z BUILD_TARGET: default 2025-05-07T19:46:51.0406036Z BUILD_VARIANT: cuda 2025-05-07T19:46:51.0406268Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:51.0406500Z ##[endgroup] 2025-05-07T19:46:51.4973217Z ################################################################################ 2025-05-07T19:46:51.4973643Z # Install PyTorch (PIP) 2025-05-07T19:46:51.4973907Z # 2025-05-07T19:46:51.4990690Z # [2025-05-07T19:46:51.498Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:51.4992487Z ################################################################################ 2025-05-07T19:46:51.4993166Z 2025-05-07T19:46:51.5016445Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:52.4388383Z Channels: 2025-05-07T19:46:52.4388842Z - conda-forge 2025-05-07T19:46:52.4389114Z Platform: linux-64 2025-05-07T19:46:55.5401905Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:57.2612973Z Solving environment: \ | / - done 2025-05-07T19:46:57.5735630Z 2025-05-07T19:46:57.5736044Z ## Package Plan ## 2025-05-07T19:46:57.5736303Z 2025-05-07T19:46:57.5736702Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:57.5737042Z 2025-05-07T19:46:57.5737149Z added / updated specs: 2025-05-07T19:46:57.5737434Z - numpy 2025-05-07T19:46:57.5737556Z 2025-05-07T19:46:57.5737560Z 2025-05-07T19:46:57.5737687Z The following packages will be downloaded: 2025-05-07T19:46:57.5737930Z 2025-05-07T19:46:57.5738078Z package | build 2025-05-07T19:46:57.5738412Z ---------------------------|----------------- 2025-05-07T19:46:57.5738829Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:57.5739344Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:57.5739832Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:57.5740304Z numpy-2.2.5 | py312h72c5963_0 8.1 MB conda-forge 2025-05-07T19:46:57.5740708Z ------------------------------------------------------------ 2025-05-07T19:46:57.5741073Z Total: 8.2 MB 2025-05-07T19:46:57.5741297Z 2025-05-07T19:46:57.5741433Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:57.5741684Z 2025-05-07T19:46:57.5741917Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:57.5742468Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:57.5743016Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:57.5743536Z numpy conda-forge/linux-64::numpy-2.2.5-py312h72c5963_0 2025-05-07T19:46:57.5744112Z 2025-05-07T19:46:57.5744116Z 2025-05-07T19:46:57.5744120Z 2025-05-07T19:46:57.5744284Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:57.5753065Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:46:57.5753387Z 2025-05-07T19:46:57.5753819Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:57.5754129Z 2025-05-07T19:46:57.5754138Z 2025-05-07T19:46:57.5755779Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:57.5756102Z 2025-05-07T19:46:57.5756111Z 2025-05-07T19:46:57.5756116Z 2025-05-07T19:46:57.7046916Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:57.7048068Z 2025-05-07T19:46:57.7048587Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:57.7048871Z 2025-05-07T19:46:57.7586619Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:57.7587453Z 2025-05-07T19:46:57.7587514Z 2025-05-07T19:46:57.7607107Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:57.7607486Z 2025-05-07T19:46:57.7607493Z 2025-05-07T19:46:57.7999055Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:57.7999885Z 2025-05-07T19:46:57.8000511Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:57.8001267Z 2025-05-07T19:46:57.8001280Z 2025-05-07T19:46:57.8078188Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:57.8079449Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:46:57.8080171Z 2025-05-07T19:46:57.8080185Z 2025-05-07T19:46:57.8080198Z 2025-05-07T19:46:57.8083116Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:57.8084003Z 2025-05-07T19:46:57.8084015Z 2025-05-07T19:46:57.8084025Z 2025-05-07T19:46:57.8300286Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:57.8301161Z 2025-05-07T19:46:57.8301239Z 2025-05-07T19:46:57.8301250Z 2025-05-07T19:46:57.8695502Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:58.2051286Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:46:58.2051759Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:46:58.2056279Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:46:58.2056692Z 2025-05-07T19:46:58.2056915Z 2025-05-07T19:46:58.2057226Z  2025-05-07T19:46:58.2057457Z 2025-05-07T19:46:58.2057461Z 2025-05-07T19:46:58.2057649Z  2025-05-07T19:46:58.2057916Z 2025-05-07T19:46:58.2057920Z 2025-05-07T19:46:58.2057946Z 2025-05-07T19:46:58.2058149Z  done 2025-05-07T19:46:58.3068410Z Preparing transaction: | done 2025-05-07T19:46:58.5084928Z Verifying transaction: - \ done 2025-05-07T19:46:58.6099264Z Executing transaction: / done 2025-05-07T19:46:58.7192659Z ################################################################################ 2025-05-07T19:46:58.7193111Z # Install Package From PyTorch PIP: torch 2025-05-07T19:46:58.7193478Z # 2025-05-07T19:46:58.7212500Z # [2025-05-07T19:46:58.720Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:46:58.7213076Z ################################################################################ 2025-05-07T19:46:58.7213356Z 2025-05-07T19:46:58.7232404Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:46:58.8106385Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:46:58.8106846Z ################################################################################ 2025-05-07T19:46:58.8107261Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:46:58.8107573Z # 2025-05-07T19:46:58.8118920Z # [2025-05-07T19:46:58.811Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:46:58.8119649Z ################################################################################ 2025-05-07T19:46:58.8119913Z 2025-05-07T19:46:58.8144621Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:46:58.8167171Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:46:58.8182477Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:46:58.8183230Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:46:58.8185738Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:46:58.8193180Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:46:58.8215962Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:31.7900372Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:31.7902004Z 2025-05-07T19:48:31.7902280Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:31.7902725Z Collecting torch 2025-05-07T19:48:31.7903477Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp312-cp312-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:31.7904297Z Collecting filelock (from torch) 2025-05-07T19:48:31.7904874Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:31.7905920Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from torch) (4.13.2) 2025-05-07T19:48:31.7907104Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from torch) (78.1.1) 2025-05-07T19:48:31.7907878Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:31.7908436Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:31.7909605Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 34.4 MB/s eta 0:00:00 2025-05-07T19:48:31.7910012Z Collecting networkx (from torch) 2025-05-07T19:48:31.7910532Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:48:31.7911398Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 15.3 MB/s eta 0:00:00 2025-05-07T19:48:31.7912444Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from torch) (3.1.6) 2025-05-07T19:48:31.7913191Z Collecting fsspec (from torch) 2025-05-07T19:48:31.7913771Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:31.7914422Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.7915234Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:31.7916117Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 50.5 MB/s eta 0:00:00 2025-05-07T19:48:31.7916601Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.7917401Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:31.7918408Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 15.6 MB/s eta 0:00:00 2025-05-07T19:48:31.7918845Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:31.7919565Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:31.7920811Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 67.6 MB/s eta 0:00:00 2025-05-07T19:48:31.7921207Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:31.7921938Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:31.7922757Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 39.8 MB/s eta 0:00:00 2025-05-07T19:48:31.7923160Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:31.7923990Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:31.7925029Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 56.8 MB/s eta 0:00:00 2025-05-07T19:48:31.7925457Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:31.7926189Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:31.7927000Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 76.1 MB/s eta 0:00:00 2025-05-07T19:48:31.7927432Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:31.7928145Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:31.7928967Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 52.9 MB/s eta 0:00:00 2025-05-07T19:48:31.7929379Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:31.7930152Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:31.7930991Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 63.0 MB/s eta 0:00:00 2025-05-07T19:48:31.7931400Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:31.7932172Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:31.7932982Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 70.2 MB/s eta 0:00:00 2025-05-07T19:48:31.7933417Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:31.7934169Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:31.7934968Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 71.4 MB/s eta 0:00:00 2025-05-07T19:48:31.7935381Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:31.7936187Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:31.7937013Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:31.7937719Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:31.7938429Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:31.7939298Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:31.7940192Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 54.0 MB/s eta 0:00:00 2025-05-07T19:48:31.7940625Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:31.7941474Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:31.7942319Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:31.7943218Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:31.7944151Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:31.7944753Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:31.7945437Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 2.7 MB/s eta 0:00:00 2025-05-07T19:48:31.7946212Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:31.7947354Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp312-cp312-manylinux_2_28_x86_64.whl (825.4 MB) 2025-05-07T19:48:31.7948181Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.4/825.4 MB 25.2 MB/s eta 0:00:00 2025-05-07T19:48:31.7950512Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:31.7951578Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 8.9 MB/s eta 0:00:00 2025-05-07T19:48:31.7952672Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:31.7953657Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 75.1 MB/s eta 0:00:00 2025-05-07T19:48:31.7954541Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:48:31.7955545Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 106.4 MB/s eta 0:00:00 2025-05-07T19:48:31.7957436Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:31.7959174Z 2025-05-07T19:48:31.7961113Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:31.7963111Z 2025-05-07T19:48:33.7066567Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:33.7067326Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:36.8392677Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:40.0197231Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:40.0199291Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:43.0695884Z True 2025-05-07T19:48:43.0696542Z True 2025-05-07T19:48:43.0696866Z 2025-05-07T19:48:43.1280290Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:43.1345212Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:43.1345980Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:43.1346669Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:43.1346991Z env: 2025-05-07T19:48:43.1347203Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:43.1347508Z BUILD_ENV: build_binary 2025-05-07T19:48:43.1347761Z BUILD_TARGET: default 2025-05-07T19:48:43.1347988Z BUILD_VARIANT: cuda 2025-05-07T19:48:43.1348236Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:43.1348476Z ##[endgroup] 2025-05-07T19:48:43.5453091Z /github/home/miniconda/bin/conda 2025-05-07T19:48:43.5453559Z ################################################################################ 2025-05-07T19:48:43.5454019Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:43.5454407Z # 2025-05-07T19:48:43.5466878Z # [2025-05-07T19:48:43.546Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:43.5467291Z ################################################################################ 2025-05-07T19:48:43.5467541Z 2025-05-07T19:48:43.5481559Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:43.6322172Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:43.6327695Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:43.6332136Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:43.6332617Z 2025-05-07T19:48:43.7180482Z 2025-05-07T19:48:43.7182023Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:43.7205364Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:49.1921577Z Collecting environment information... 2025-05-07T19:48:49.1922727Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:49.1923842Z Is debug build: False 2025-05-07T19:48:49.1924632Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:49.1925259Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:49.1925462Z 2025-05-07T19:48:49.1925587Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:49.1925963Z GCC version: Could not collect 2025-05-07T19:48:49.1926589Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:49.1927247Z CMake version: version 4.0.2 2025-05-07T19:48:49.1927550Z Libc version: glibc-2.34 2025-05-07T19:48:49.1927746Z 2025-05-07T19:48:49.1928203Z Python version: 3.12.2 | packaged by conda-forge | (main, Feb 16 2024, 20:50:58) [GCC 12.3.0] (64-bit runtime) 2025-05-07T19:48:49.1928914Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:49.1929367Z Is CUDA available: False 2025-05-07T19:48:49.1929673Z CUDA runtime version: 12.6.85 2025-05-07T19:48:49.1929966Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:49.1930327Z GPU models and configuration: Could not collect 2025-05-07T19:48:49.1930694Z Nvidia driver version: Could not collect 2025-05-07T19:48:49.1931055Z cuDNN version: Could not collect 2025-05-07T19:48:49.1931354Z HIP runtime version: N/A 2025-05-07T19:48:49.1931776Z MIOpen runtime version: N/A 2025-05-07T19:48:49.1932075Z Is XNNPACK available: True 2025-05-07T19:48:49.1932246Z 2025-05-07T19:48:49.1932335Z CPU: 2025-05-07T19:48:49.1932592Z Architecture: x86_64 2025-05-07T19:48:49.1932939Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:49.1933379Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:49.1933791Z Byte Order: Little Endian 2025-05-07T19:48:49.1934500Z CPU(s): 96 2025-05-07T19:48:49.1934818Z On-line CPU(s) list: 0-95 2025-05-07T19:48:49.1935180Z Vendor ID: GenuineIntel 2025-05-07T19:48:49.1935793Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:49.1936192Z CPU family: 6 2025-05-07T19:48:49.1936519Z Model: 85 2025-05-07T19:48:49.1936819Z Thread(s) per core: 2 2025-05-07T19:48:49.1937159Z Core(s) per socket: 24 2025-05-07T19:48:49.1937470Z Socket(s): 2 2025-05-07T19:48:49.1937799Z Stepping: 7 2025-05-07T19:48:49.1938108Z BogoMIPS: 5999.99 2025-05-07T19:48:49.1940412Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:49.1942731Z Hypervisor vendor: KVM 2025-05-07T19:48:49.1943083Z Virtualization type: full 2025-05-07T19:48:49.1943439Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:49.1943842Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:49.1944221Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:49.1944628Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:49.1944974Z NUMA node(s): 2 2025-05-07T19:48:49.1945320Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:49.1945672Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:49.1946173Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:49.1946762Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:49.1947256Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:49.1947893Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:49.1948472Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:49.1949102Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:49.1949739Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:49.1950124Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:49.1950530Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:49.1950912Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:49.1951613Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:49.1952675Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:49.1953392Z Vulnerability Srbds: Not affected 2025-05-07T19:48:49.1953825Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:49.1954083Z 2025-05-07T19:48:49.1954202Z Versions of relevant libraries: 2025-05-07T19:48:49.1954531Z [pip3] numpy==2.2.5 2025-05-07T19:48:49.1954803Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:49.1955158Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:49.1955500Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:49.1955865Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:49.1956312Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:49.1956661Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:49.1957010Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:49.1957340Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:49.1957811Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:49.1958158Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:49.1958515Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:49.1958831Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:49.1959196Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:49.1959522Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:49.1959893Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:49.1960294Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:49.1960856Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:49.1961449Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:49.1962032Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:49.1962653Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:49.1963247Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:49.1963808Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1964355Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:49.1965131Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:49.1965721Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:49.1966265Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1966822Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:49.1967347Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1967882Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1968440Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:49.1968971Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:49.1969502Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:49.1970022Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:49.1970565Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1971104Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:49.1971620Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1972172Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:49.1972698Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:49.1973266Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:49.1973801Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1974361Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:49.1974927Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:49.1975473Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:49.1976016Z [conda] numpy 2.2.5 py312h72c5963_0 conda-forge 2025-05-07T19:48:49.1976517Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:49.1977195Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:49.1977853Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:49.1978396Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:49.1979009Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:49.1979497Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:49.1980014Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:49.1980515Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:49.1981047Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:49.1981590Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:49.1982089Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:49.1982612Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:49.1983108Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:49.1983620Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:49.1984102Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:49.1984410Z 2025-05-07T19:48:49.2683428Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:49.2684181Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:49.2684721Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:49.2685050Z env: 2025-05-07T19:48:49.2685264Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:49.2685564Z BUILD_ENV: build_binary 2025-05-07T19:48:49.2685797Z BUILD_TARGET: default 2025-05-07T19:48:49.2686035Z BUILD_VARIANT: cuda 2025-05-07T19:48:49.2686255Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:49.2686529Z ##[endgroup] 2025-05-07T19:48:49.6917370Z ################################################################################ 2025-05-07T19:48:49.6918412Z # Install cuDNN 2025-05-07T19:48:49.6919090Z # 2025-05-07T19:48:49.6933957Z # [2025-05-07T19:48:49.692Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:49.6934673Z ################################################################################ 2025-05-07T19:48:49.6934927Z 2025-05-07T19:48:49.6952440Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:49.7827105Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:49.7828355Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:49.7829498Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:49.7830166Z 2025-05-07T19:48:49.7839678Z 2025-05-07T19:48:49.7840359Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:49.7841132Z 2025-05-07T19:48:49.7855544Z 2025-05-07T19:48:49.7870272Z [INSTALL] Downloading cuDNN to /tmp/tmp.t2O7EmpC61 ... 2025-05-07T19:48:49.7891574Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:53.7123688Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:53.7124190Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:53.7124384Z 2025-05-07T19:48:53.7152949Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:53.7154124Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:53.7154588Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:48:58.3983319Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:48:58.4620503Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:49:06.0748154Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:49:06.3230316Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:49:06.3614872Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:49:06.9099217Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:49:09.0534374Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:49:09.0536007Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:49:09.0536910Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:49:09.0537584Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:49:09.0538194Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:49:09.0538718Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:49:09.0539370Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:49:09.0539825Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:49:09.0540262Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:49:09.0540719Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:49:09.0543817Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:49:09.0544623Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:49:09.0545132Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:49:13.6139164Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:49:13.6140694Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:49:13.6759819Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:49:13.6761579Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:20.9075161Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:20.9075831Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:20.9076491Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:20.9077273Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:21.1050711Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:21.1051722Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:21.1052234Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:21.1052780Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:21.1415485Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:21.6873952Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:21.6874869Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:21.6875399Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:21.6876041Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:21.6876546Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:23.8225611Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:23.8227013Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:23.8228455Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:23.8229968Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:23.8231658Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:23.8232817Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:23.8233336Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:23.8234175Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:23.8234702Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:23.8235175Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:23.8235703Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:23.8236215Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:23.8236741Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:23.8237239Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:23.8237770Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:23.8238258Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:23.8244757Z 2025-05-07T19:49:23.8245553Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:23.8246686Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:23.8246956Z 2025-05-07T19:49:23.8264186Z 2025-05-07T19:49:23.8265185Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:23.8265659Z 2025-05-07T19:49:23.8279192Z 2025-05-07T19:49:23.8280355Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:23.8281587Z 2025-05-07T19:49:23.8307107Z 2025-05-07T19:49:23.8307990Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:23.8308417Z 2025-05-07T19:49:24.9689706Z 2025-05-07T19:49:24.9690284Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:24.9691100Z + rm -rf /tmp/tmp.t2O7EmpC61 2025-05-07T19:49:24.9691655Z 2025-05-07T19:49:25.4048487Z 2025-05-07T19:49:25.4061650Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:25.4062702Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:25.4063442Z 2025-05-07T19:49:25.8233316Z 2025-05-07T19:49:25.8234141Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:25.8305163Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:25.8305790Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:25.8306449Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:25.8306814Z env: 2025-05-07T19:49:25.8307053Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:25.8307396Z BUILD_ENV: build_binary 2025-05-07T19:49:25.8307690Z BUILD_TARGET: default 2025-05-07T19:49:25.8307934Z BUILD_VARIANT: cuda 2025-05-07T19:49:25.8308210Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:25.8308472Z ##[endgroup] 2025-05-07T19:49:26.3074977Z ################################################################################ 2025-05-07T19:49:26.3075434Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:26.3075743Z # 2025-05-07T19:49:26.3092983Z # [2025-05-07T19:49:26.308Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:26.3094191Z ################################################################################ 2025-05-07T19:49:26.3094439Z 2025-05-07T19:49:26.3109825Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:26.3945170Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:26.4056471Z [BUILD] Running git submodules update ... 2025-05-07T19:49:26.4056894Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:26.4326843Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:26.4327370Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:26.4327871Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:26.4328328Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:26.4328756Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:26.4329243Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:26.4329992Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:26.4363907Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:26.4798955Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:26.4820489Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:28.3627008Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:28.3825843Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:28.3903190Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:28.4970400Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:28.4998820Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:28.5070280Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:28.5074492Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:28.5076552Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:28.5077851Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:28.5450002Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:28.5481564Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:28.5554770Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:49:28.5697651Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:28.5731331Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:28.5796721Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:28.5798106Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:28.5809564Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:28.6013053Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:28.6045353Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:28.6223848Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:28.6261606Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:28.6539920Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:28.6586315Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:28.6673300Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:28.6675975Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:28.6721453Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:28.6728553Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:28.6778631Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:28.6906674Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:28.6941431Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:28.7014764Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:28.7028592Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:28.7038384Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:28.7314902Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:28.7349692Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:28.7461400Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:28.7563064Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:28.9358614Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 156.9 MB/s eta 0:00:00 2025-05-07T19:49:28.9393530Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:28.9474864Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:28.9543174Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:28.9608593Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:28.9669065Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:28.9754255Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:28.9821549Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:29.1322303Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:29.9877050Z 2025-05-07T19:49:29.9920522Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:29.9926362Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:30.1171477Z ################################################################################ 2025-05-07T19:49:30.1172568Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:30.1173123Z # 2025-05-07T19:49:30.1191767Z # [2025-05-07T19:49:30.118Z] + install_triton_pip build_binary 2025-05-07T19:49:30.1193035Z ################################################################################ 2025-05-07T19:49:30.1193719Z 2025-05-07T19:49:30.1194442Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:30.1195773Z ################################################################################ 2025-05-07T19:49:30.1197357Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:30.1198233Z # 2025-05-07T19:49:30.1208928Z # [2025-05-07T19:49:30.120Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:30.1210679Z ################################################################################ 2025-05-07T19:49:30.1211359Z 2025-05-07T19:49:30.1225645Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:30.2046814Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:30.2047576Z ################################################################################ 2025-05-07T19:49:30.2048210Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:30.2048705Z # 2025-05-07T19:49:30.2071466Z # [2025-05-07T19:49:30.206Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:30.2072059Z ################################################################################ 2025-05-07T19:49:30.2072326Z 2025-05-07T19:49:30.2123652Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:30.2137071Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:30.2137865Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:30.2140708Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:30.2152905Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:30.2171148Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:35.4162011Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:35.4163892Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:35.4166365Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:35.4167880Z 2025-05-07T19:49:35.4168138Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:35.4168619Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:35.4169515Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:35.4170918Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:49:35.4172305Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 190.8 MB/s eta 0:00:00 2025-05-07T19:49:35.4172733Z Installing collected packages: pytorch-triton 2025-05-07T19:49:35.4173149Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:35.4173577Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:35.4174072Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:35.4174528Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:35.4175035Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:35.4175324Z 2025-05-07T19:49:37.3187676Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:37.3188349Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:39.1343131Z ################################################################################ 2025-05-07T19:49:39.1344171Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:39.1344719Z ################################################################################ 2025-05-07T19:49:39.1344954Z 2025-05-07T19:49:40.8791126Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:42.7195379Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:42.7196548Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:42.7272913Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:42.7273683Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:42.7274331Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:42.7274666Z env: 2025-05-07T19:49:42.7274940Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:42.7275255Z BUILD_ENV: build_binary 2025-05-07T19:49:42.7275527Z BUILD_TARGET: default 2025-05-07T19:49:42.7275799Z BUILD_VARIANT: cuda 2025-05-07T19:49:42.7276062Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:42.7276321Z ##[endgroup] 2025-05-07T19:49:43.1507942Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:43.1508380Z [BUILD] Extracted build target: default 2025-05-07T19:49:43.1508728Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:44.7441658Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:44.7442468Z 2025-05-07T19:49:44.8007722Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:46.4014202Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:46.4015061Z 2025-05-07T19:49:46.4787627Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:48.0622211Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:48.0622593Z 2025-05-07T19:49:48.1211504Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:49.7070822Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:49.7071867Z 2025-05-07T19:49:49.7652781Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:51.4061102Z [BUILD] Extracted and set Python tag: py312 2025-05-07T19:49:51.4061682Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:51.4297271Z core = 24 2025-05-07T19:49:51.4519713Z sockets = 2 2025-05-07T19:49:51.4520100Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:51.4520525Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:51.4520830Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:51.4521236Z + rm -rf dist 2025-05-07T19:49:51.4521377Z 2025-05-07T19:49:51.4546848Z 2025-05-07T19:49:51.4547307Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:51.4547664Z 2025-05-07T19:49:54.4075458Z INFO:root:running clean 2025-05-07T19:49:54.4075979Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:54.4077131Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:54.4078279Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:54.4078813Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:54.4079406Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:54.4080046Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:54.4080846Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:54.4081263Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:54.4082617Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:54.7545848Z 2025-05-07T19:49:54.7546358Z [BUILD] Printing git status ... 2025-05-07T19:49:54.7550516Z + git status 2025-05-07T19:49:54.7550656Z 2025-05-07T19:49:55.2426136Z HEAD detached at pull/4066/merge 2025-05-07T19:49:55.2427055Z Untracked files: 2025-05-07T19:49:55.2427563Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:55.2428042Z ../build_only/ 2025-05-07T19:49:55.2428297Z ../collect_env.py 2025-05-07T19:49:55.2428590Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:55.2428779Z 2025-05-07T19:49:55.2429423Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:55.2429833Z 2025-05-07T19:49:55.2429937Z + git diff 2025-05-07T19:49:55.2430064Z 2025-05-07T19:49:55.2715005Z 2025-05-07T19:49:55.2715686Z ################################################################################ 2025-05-07T19:49:55.2716757Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:55.2717546Z # 2025-05-07T19:49:55.2744260Z # [2025-05-07T19:49:55.273Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:55.2745478Z ################################################################################ 2025-05-07T19:49:55.2746203Z 2025-05-07T19:49:55.2748329Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:55.2749615Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:56.9019096Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:56.9019444Z 2025-05-07T19:49:56.9809588Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:49:58.6115524Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:58.6115831Z 2025-05-07T19:49:58.6874857Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:50:00.3130678Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:50:00.3131062Z 2025-05-07T19:50:00.3892259Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:50:02.0069086Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:02.0069475Z 2025-05-07T19:50:02.0654987Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:50:03.7397534Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:50:03.7398427Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:50:03.7398775Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:50:03.7399148Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:50:03.7399553Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:50:03.7399991Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:50:03.7400384Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:50:05.4141161Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:50:08.8221463Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:50:08.8222682Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:50:08.8223517Z 2025-05-07T19:50:09.2412953Z 2025-05-07T19:50:09.2413543Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:10.8863183Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:14.2188825Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:14.2189194Z 2025-05-07T19:50:15.8754692Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:15.8757342Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:15.8758736Z 2025-05-07T19:50:15.8759093Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:15.8760580Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:15.8761508Z 2025-05-07T19:50:16.2836984Z 2025-05-07T19:50:16.2838173Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:16.2838510Z 2025-05-07T19:50:17.8670771Z -std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:17.8671918Z 2025-05-07T19:50:17.9234630Z 2025-05-07T19:50:17.9235544Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:17.9235987Z + conda run -n build_binary c++ --version 2025-05-07T19:50:17.9236236Z 2025-05-07T19:50:19.5314920Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:19.5316534Z Target: x86_64-conda-linux-gnu 2025-05-07T19:50:19.5316829Z Thread model: posix 2025-05-07T19:50:19.5317171Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:50:19.5317800Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:19.5318280Z 2025-05-07T19:50:19.5881282Z 2025-05-07T19:50:19.5881701Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:19.5882230Z 2025-05-07T19:50:21.2543271Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:21.2544269Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:21.2544743Z 2025-05-07T19:50:21.2544942Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:50:22.9017775Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:22.9018516Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:22.9021215Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:50:22.9023769Z ################################################################################ 2025-05-07T19:50:22.9024096Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:22.9024388Z # 2025-05-07T19:50:22.9034415Z # [2025-05-07T19:50:22.902Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:22.9035881Z ################################################################################ 2025-05-07T19:50:22.9036547Z 2025-05-07T19:50:22.9037136Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:22.9044739Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py312 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:22.9049357Z 2025-05-07T19:50:24.5330517Z * Getting build dependencies for wheel... 2025-05-07T19:50:25.9242584Z INFO:root:running egg_info 2025-05-07T19:50:25.9280981Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:25.9281753Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:25.9285814Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:25.9289194Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:25.9291046Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:25.9292147Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:25.9356355Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:25.9368598Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:25.9372254Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:25.9375388Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:25.9378677Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:25.9379194Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:25.9379744Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:25.9380346Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:25.9380891Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:25.9381312Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:25.9382491Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:26.2491791Z * Building wheel... 2025-05-07T19:50:27.6383703Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-8whs2af1', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py312', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:27.6388257Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:50:27.6391476Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-8whs2af1', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py312', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:27.6393560Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:27.6394153Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:27.6394783Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:27.6395777Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:27.6396261Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:27.6402486Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:27.6408350Z 2025-05-07T19:50:27.6408355Z 2025-05-07T19:50:27.6408524Z -------------------------------------------------------------------------------- 2025-05-07T19:50:27.6408930Z -- Trying 'Ninja' generator 2025-05-07T19:50:27.6409195Z -------------------------------- 2025-05-07T19:50:27.6409479Z --------------------------- 2025-05-07T19:50:27.6409724Z ---------------------- 2025-05-07T19:50:27.6410020Z ----------------- 2025-05-07T19:50:27.6410240Z ------------ 2025-05-07T19:50:27.6410467Z ------- 2025-05-07T19:50:27.6410665Z -- 2025-05-07T19:50:27.6803095Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:27.6804700Z Not searching for unused variables given on the command line. 2025-05-07T19:50:27.6806283Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:27.6807561Z CMake. 2025-05-07T19:50:27.6807894Z 2025-05-07T19:50:27.6808555Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:27.6810222Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:27.6811630Z to work with policies introduced by or earlier. 2025-05-07T19:50:27.6812512Z 2025-05-07T19:50:27.6812517Z 2025-05-07T19:50:27.7669198Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:27.7776783Z -- Detecting C compiler ABI info 2025-05-07T19:50:27.9075386Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:27.9200263Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:27.9202612Z -- Detecting C compile features 2025-05-07T19:50:27.9205383Z -- Detecting C compile features - done 2025-05-07T19:50:28.0711070Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:28.0794309Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:28.2329712Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:28.2460151Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:28.2461776Z -- Detecting CXX compile features 2025-05-07T19:50:28.2467862Z -- Detecting CXX compile features - done 2025-05-07T19:50:28.2481859Z -- Configuring done (0.6s) 2025-05-07T19:50:28.2535519Z -- Generating done (0.0s) 2025-05-07T19:50:28.2545714Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:28.2593798Z -- 2025-05-07T19:50:28.2594445Z ------- 2025-05-07T19:50:28.2595044Z ------------ 2025-05-07T19:50:28.2595670Z ----------------- 2025-05-07T19:50:28.2596286Z ---------------------- 2025-05-07T19:50:28.2596986Z --------------------------- 2025-05-07T19:50:28.2597715Z -------------------------------- 2025-05-07T19:50:28.2598564Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:28.2599347Z -------------------------------------------------------------------------------- 2025-05-07T19:50:28.2599676Z 2025-05-07T19:50:28.2603478Z Configuring Project 2025-05-07T19:50:28.2603755Z Working directory: 2025-05-07T19:50:28.2604339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build 2025-05-07T19:50:28.2604792Z Command: 2025-05-07T19:50:28.2618317Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install -DPYTHON_VERSION_STRING:STRING=3.12.2 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.12 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.12.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.12 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.12 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_BUILD_TYPE:STRING=Release 2025-05-07T19:50:28.2631867Z 2025-05-07T19:50:28.3015734Z 2025-05-07T19:50:28.3015751Z 2025-05-07T19:50:28.3016263Z ================================================================================ 2025-05-07T19:50:28.3017355Z Default C compiler flags 2025-05-07T19:50:28.3018415Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:28.3019351Z 2025-05-07T19:50:28.3021446Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:28.3022490Z ================================================================================ 2025-05-07T19:50:28.3022721Z 2025-05-07T19:50:28.3022725Z 2025-05-07T19:50:28.3022729Z 2025-05-07T19:50:28.3022841Z ================================================================================ 2025-05-07T19:50:28.3023194Z Default C++ compiler flags 2025-05-07T19:50:28.3023550Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:28.3023880Z 2025-05-07T19:50:28.3024671Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:28.3025682Z ================================================================================ 2025-05-07T19:50:28.3025906Z 2025-05-07T19:50:28.3025910Z 2025-05-07T19:50:28.3025913Z 2025-05-07T19:50:28.3026027Z ================================================================================ 2025-05-07T19:50:28.3026352Z AVX2_FLAGS: 2025-05-07T19:50:28.3026474Z 2025-05-07T19:50:28.3026584Z -mavx2 2025-05-07T19:50:28.3026780Z -mf16c 2025-05-07T19:50:28.3026998Z -mfma 2025-05-07T19:50:28.3027194Z -fopenmp 2025-05-07T19:50:28.3027440Z ================================================================================ 2025-05-07T19:50:28.3027660Z 2025-05-07T19:50:28.3027668Z 2025-05-07T19:50:28.3027671Z 2025-05-07T19:50:28.3027784Z ================================================================================ 2025-05-07T19:50:28.3028109Z AVX512_FLAGS: 2025-05-07T19:50:28.3028235Z 2025-05-07T19:50:28.3028321Z -mavx2 2025-05-07T19:50:28.3028537Z -mf16c 2025-05-07T19:50:28.3028750Z -mfma 2025-05-07T19:50:28.3028943Z -mavx512f 2025-05-07T19:50:28.3029169Z -mavx512bw 2025-05-07T19:50:28.3029377Z -mavx512dq 2025-05-07T19:50:28.3029614Z -mavx512vl 2025-05-07T19:50:28.3029821Z -fopenmp 2025-05-07T19:50:28.3030081Z ================================================================================ 2025-05-07T19:50:28.3030302Z 2025-05-07T19:50:28.3030305Z 2025-05-07T19:50:28.3030309Z 2025-05-07T19:50:28.3030425Z ================================================================================ 2025-05-07T19:50:28.3030783Z The project is built using scikit-build 2025-05-07T19:50:28.3031125Z ================================================================================ 2025-05-07T19:50:28.3031491Z 2025-05-07T19:50:28.3031495Z 2025-05-07T19:50:28.3031498Z 2025-05-07T19:50:28.3031787Z ================================================================================ 2025-05-07T19:50:28.3032144Z Build Settings 2025-05-07T19:50:28.3032283Z 2025-05-07T19:50:28.3032399Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:28.3032743Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:28.3032937Z 2025-05-07T19:50:28.3033082Z NVCC_VERBOSE : 2025-05-07T19:50:28.3033360Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:28.3033653Z CUDNN_LIBRARY : 2025-05-07T19:50:28.3034097Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:28.3034622Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:28.3034994Z Not searching for unused variables given on the command line. 2025-05-07T19:50:28.3035429Z 8.0 2025-05-07T19:50:28.3035633Z 9.0 2025-05-07T19:50:28.3035855Z 9.0a 2025-05-07T19:50:28.3035973Z 2025-05-07T19:50:28.3036099Z HIP_ROOT_DIR : 2025-05-07T19:50:28.3036523Z HIPCC_VERBOSE : 2025-05-07T19:50:28.3036827Z AMDGPU_TARGETS : 2025-05-07T19:50:28.3037102Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:28.3037418Z ================================================================================ 2025-05-07T19:50:28.3037660Z 2025-05-07T19:50:28.4597995Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:28.5354801Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:29.6130574Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler Clang 16.0.6 2025-05-07T19:50:29.6237144Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:29.7768146Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:29.7897701Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:29.7898355Z -- Detecting CXX compile features 2025-05-07T19:50:29.7905469Z -- Detecting CXX compile features - done 2025-05-07T19:50:29.7982337Z -- Detecting C compiler ABI info 2025-05-07T19:50:29.9246117Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:29.9371297Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:29.9372335Z -- Detecting C compile features 2025-05-07T19:50:29.9376936Z -- Detecting C compile features - done 2025-05-07T19:50:29.9427488Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:30.9741838Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:31.0269717Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:31.0302945Z -- Detecting CUDA compile features 2025-05-07T19:50:31.0303786Z -- Detecting CUDA compile features - done 2025-05-07T19:50:31.0329537Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:31.3216814Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:31.3217440Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:31.6605260Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:31.6606305Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:31.9500568Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:31.9500931Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:32.2855262Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:32.2856273Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:32.5761902Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:32.5762935Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:32.9149842Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:32.9150906Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:33.2037660Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:33.2038718Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:33.5412232Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:33.5413253Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:33.8304762Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:33.8305775Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:34.1632623Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:34.1633687Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:34.4521930Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:34.4522342Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:34.7979611Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:34.8152153Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:34.8187975Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:34.8251819Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:34.9535843Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:50:34.9546608Z -- Found Threads: TRUE 2025-05-07T19:50:34.9554430Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Caffe2/FindCUDAToolkit.cmake:957 (message): 2025-05-07T19:50:34.9555370Z Could not find librt library, needed by CUDA::cudart_static 2025-05-07T19:50:34.9556142Z Call Stack (most recent call first): 2025-05-07T19:50:34.9556910Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:59 (find_package) 2025-05-07T19:50:34.9558046Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:34.9559538Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:34.9560348Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:34.9560809Z CMakeLists.txt:112 (include) 2025-05-07T19:50:34.9560988Z 2025-05-07T19:50:34.9560992Z 2025-05-07T19:50:35.0821301Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:35.0821845Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:35.0822592Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:35.2583157Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:35.3562375Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.12.2") found components: Interpreter 2025-05-07T19:50:35.3576183Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:35.3578713Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:35.3580119Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:35.3580844Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:35.3581271Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:35.3581715Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:35.3582061Z Call Stack (most recent call first): 2025-05-07T19:50:35.3582784Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:35.3584016Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:35.3584825Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:35.3585276Z CMakeLists.txt:112 (include) 2025-05-07T19:50:35.3585458Z 2025-05-07T19:50:35.3585466Z 2025-05-07T19:50:35.3586001Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:35.3914764Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:35.3917285Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:35.3918339Z Call Stack (most recent call first): 2025-05-07T19:50:35.3919796Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:35.3920712Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:35.3921309Z CMakeLists.txt:112 (include) 2025-05-07T19:50:35.3921490Z 2025-05-07T19:50:35.3921495Z 2025-05-07T19:50:35.3922076Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:35.3922592Z 2025-05-07T19:50:35.3922596Z 2025-05-07T19:50:35.3922719Z ================================================================================ 2025-05-07T19:50:35.3923080Z PyTorch Flags: 2025-05-07T19:50:35.3923340Z 2025-05-07T19:50:35.3923552Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:35.3924005Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:35.3924790Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:35.3926811Z 2025-05-07T19:50:35.3927023Z TORCH_LIBRARIES: 2025-05-07T19:50:35.3927279Z torch 2025-05-07T19:50:35.3927493Z torch_library 2025-05-07T19:50:35.3927988Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:35.3928797Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:35.3929573Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:35.3930118Z 2025-05-07T19:50:35.3930325Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:35.3930609Z --expt-relaxed-constexpr 2025-05-07T19:50:35.3930883Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:35.3931201Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:35.3931503Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:35.3931827Z ================================================================================ 2025-05-07T19:50:35.3932052Z 2025-05-07T19:50:35.3932061Z 2025-05-07T19:50:35.3932064Z 2025-05-07T19:50:35.3932213Z ================================================================================ 2025-05-07T19:50:35.3932528Z NCCL Flags 2025-05-07T19:50:35.3932684Z 2025-05-07T19:50:35.3933053Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:35.3933926Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:35.3934574Z ================================================================================ 2025-05-07T19:50:35.3934799Z 2025-05-07T19:50:35.3934802Z 2025-05-07T19:50:35.3934806Z 2025-05-07T19:50:35.3934951Z ================================================================================ 2025-05-07T19:50:35.3935267Z CUDA Driver Path 2025-05-07T19:50:35.3935436Z 2025-05-07T19:50:35.3935787Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:35.3936357Z ================================================================================ 2025-05-07T19:50:35.3936610Z 2025-05-07T19:50:35.3936895Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:35.3948668Z 2025-05-07T19:50:35.3948794Z 2025-05-07T19:50:35.3949255Z ================================================================================ 2025-05-07T19:50:35.3950386Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:35.3951521Z 2025-05-07T19:50:35.3952093Z CPU_SRCS: 2025-05-07T19:50:35.3952459Z 2025-05-07T19:50:35.3952688Z 2025-05-07T19:50:35.3953233Z GPU_SRCS: 2025-05-07T19:50:35.3953560Z 2025-05-07T19:50:35.3953771Z 2025-05-07T19:50:35.3954343Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:35.3954754Z 2025-05-07T19:50:35.3954969Z 2025-05-07T19:50:35.3955522Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:35.3955924Z 2025-05-07T19:50:35.3956133Z 2025-05-07T19:50:35.3956669Z OTHER_SRCS: 2025-05-07T19:50:35.3957797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:35.3959608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:35.3961394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:35.3963201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:35.3963903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:35.3964516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:35.3965331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:35.3965968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:35.3966574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:35.3967204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:35.3968069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:35.3968701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:35.3969348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:35.3969957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:35.3970692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:35.3971355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:35.3971971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:35.3972727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:35.3973321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:35.3973949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:35.3974538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:35.3975153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:35.3975775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:35.3976393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:35.3977024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:35.3977599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:35.3978220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:35.3978833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:35.3979430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:35.3980022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:35.3980615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:35.3981246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:35.3981836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:35.3982432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:35.3983026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:35.3983601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:35.3984196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:35.3984766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:35.3985359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:35.3985933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:35.3986527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:35.3987116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:35.3987689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:35.3988275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:35.3988845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:35.3989457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:35.3990049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:35.3990668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:35.3991499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:35.3992302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:35.3992944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:35.3993558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:35.3994296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:35.3994962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:35.3995569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:35.3996198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:35.3996807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:35.3997435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:35.3998163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:35.3998624Z 2025-05-07T19:50:35.3998823Z CC_FLAGS: 2025-05-07T19:50:35.3998968Z 2025-05-07T19:50:35.3999052Z 2025-05-07T19:50:35.3999272Z NVCC_FLAGS: 2025-05-07T19:50:35.3999402Z 2025-05-07T19:50:35.3999490Z 2025-05-07T19:50:35.3999704Z HIPCC_FLAGS: 2025-05-07T19:50:35.3999841Z 2025-05-07T19:50:35.3999928Z 2025-05-07T19:50:35.4000152Z INCLUDE_DIRS: 2025-05-07T19:50:35.4000402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:35.4000762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:35.4001060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:35.4001414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:35.4001944Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:35.4002739Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:35.4003437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:35.4003874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:35.4004350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:35.4004838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:35.4005406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:35.4005910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:35.4006484Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:35.4007023Z 2025-05-07T19:50:35.4007244Z Selected Source Files: 2025-05-07T19:50:35.4007438Z 2025-05-07T19:50:35.4007531Z 2025-05-07T19:50:35.4007754Z HIPified Source Files: 2025-05-07T19:50:35.4007944Z 2025-05-07T19:50:35.4008027Z 2025-05-07T19:50:35.4008249Z Library Dependencies: 2025-05-07T19:50:35.4008539Z torch 2025-05-07T19:50:35.4008755Z torch_library 2025-05-07T19:50:35.4009227Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:35.4009945Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:35.4010638Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:35.4011459Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:35.4012200Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:35.4012823Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:35.4013251Z 2025-05-07T19:50:35.4013448Z Output Library: 2025-05-07T19:50:35.4013699Z asmjit 2025-05-07T19:50:35.4013896Z 2025-05-07T19:50:35.4014127Z Destination Directory: 2025-05-07T19:50:35.4014376Z fbgemm_gpu 2025-05-07T19:50:35.4014641Z ================================================================================ 2025-05-07T19:50:35.4014967Z 2025-05-07T19:50:35.4014972Z 2025-05-07T19:50:35.4014975Z 2025-05-07T19:50:35.4015094Z ================================================================================ 2025-05-07T19:50:35.4015468Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:35.4015797Z 2025-05-07T19:50:35.4016006Z CPU_SRCS: 2025-05-07T19:50:35.4016128Z 2025-05-07T19:50:35.4016241Z 2025-05-07T19:50:35.4016505Z GPU_SRCS: 2025-05-07T19:50:35.4016633Z 2025-05-07T19:50:35.4016745Z 2025-05-07T19:50:35.4016954Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:35.4017132Z 2025-05-07T19:50:35.4017216Z 2025-05-07T19:50:35.4017413Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:35.4017586Z 2025-05-07T19:50:35.4017671Z 2025-05-07T19:50:35.4017873Z OTHER_SRCS: 2025-05-07T19:50:35.4018178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:35.4018655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:35.4019160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:35.4019607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:35.4020059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:35.4020548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:35.4021045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:35.4021434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:35.4021866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:35.4022296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:35.4022742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:35.4023190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:35.4023619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:35.4024011Z 2025-05-07T19:50:35.4024209Z CC_FLAGS: 2025-05-07T19:50:35.4024330Z 2025-05-07T19:50:35.4024443Z 2025-05-07T19:50:35.4024637Z NVCC_FLAGS: 2025-05-07T19:50:35.4024786Z 2025-05-07T19:50:35.4024868Z 2025-05-07T19:50:35.4025067Z HIPCC_FLAGS: 2025-05-07T19:50:35.4025223Z 2025-05-07T19:50:35.4025308Z 2025-05-07T19:50:35.4025505Z INCLUDE_DIRS: 2025-05-07T19:50:35.4025770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:35.4026123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:35.4026414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:35.4026765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:35.4027261Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:35.4028055Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:35.4028704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:35.4029144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:35.4029576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:35.4030081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:35.4030623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:35.4031084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:35.4031958Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:35.4032482Z 2025-05-07T19:50:35.4032823Z Selected Source Files: 2025-05-07T19:50:35.4032985Z 2025-05-07T19:50:35.4033070Z 2025-05-07T19:50:35.4033305Z HIPified Source Files: 2025-05-07T19:50:35.4033475Z 2025-05-07T19:50:35.4033588Z 2025-05-07T19:50:35.4033810Z Library Dependencies: 2025-05-07T19:50:35.4034089Z torch 2025-05-07T19:50:35.4034302Z torch_library 2025-05-07T19:50:35.4034787Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:35.4035488Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:35.4036321Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:35.4037136Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:35.4037905Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:35.4038412Z asmjit 2025-05-07T19:50:35.4038825Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:35.4039262Z 2025-05-07T19:50:35.4039467Z Output Library: 2025-05-07T19:50:35.4039744Z fbgemm 2025-05-07T19:50:35.4039957Z 2025-05-07T19:50:35.4040201Z Destination Directory: 2025-05-07T19:50:35.4040462Z fbgemm_gpu 2025-05-07T19:50:35.4040734Z ================================================================================ 2025-05-07T19:50:35.4040975Z 2025-05-07T19:50:35.4040979Z 2025-05-07T19:50:35.4040983Z 2025-05-07T19:50:35.4041135Z ================================================================================ 2025-05-07T19:50:35.4041498Z Running code generation script ... 2025-05-07T19:50:35.4042299Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:35.4043094Z ================================================================================ 2025-05-07T19:50:35.4043367Z 2025-05-07T19:50:35.9537081Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:35.9538076Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:35.9538857Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:35.9539352Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:35.9539976Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:35.9540479Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:35.9540981Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:35.9541453Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:35.9541940Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:35.9542445Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:35.9542954Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:35.9543458Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:35.9544142Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:35.9544803Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:35.9545319Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:35.9545891Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:35.9546436Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:35.9546957Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:35.9547498Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:35.9548018Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:35.9548595Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:35.9549128Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:35.9549641Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:35.9550081Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:35.9550456Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:35.9550898Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:35.9551520Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:35.9552523Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:35.9553037Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:35.9553603Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:35.9554182Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:35.9554713Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:35.9555480Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:35.9556078Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:35.9556669Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:35.9557248Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:35.9557865Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:35.9558441Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:35.9558910Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:35.9559356Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:35.9559831Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:35.9560294Z Written: lookup_adagrad.py 2025-05-07T19:50:35.9560633Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:35.9561091Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:35.9561559Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:35.9562093Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:35.9562612Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:35.9563114Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:35.9563660Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:35.9564163Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:35.9564893Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:35.9565388Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:35.9565926Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:35.9566483Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:35.9566997Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:35.9567540Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:35.9568073Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:35.9568643Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:35.9569222Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:35.9569795Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:35.9570363Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:35.9570899Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:35.9571472Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:35.9572053Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:35.9572642Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:35.9573168Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:35.9573639Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:35.9574067Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:35.9574531Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:35.9574976Z Written: lookup_adam.py 2025-05-07T19:50:35.9575287Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:35.9575763Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:35.9576534Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:35.9577149Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:35.9577660Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:35.9578114Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:35.9578704Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:35.9598672Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:35.9599221Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:35.9599810Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:35.9600384Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:35.9600957Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:35.9601518Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:35.9602134Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:35.9602691Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:35.9603140Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:35.9603580Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:35.9604172Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:35.9604597Z Written: lookup_lamb.py 2025-05-07T19:50:35.9604900Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:35.9605358Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:35.9605828Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:35.9606359Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:35.9606927Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:35.9607414Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:35.9607944Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:35.9608461Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:35.9608987Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:35.9609575Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:35.9610132Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:35.9610689Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:35.9611232Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:35.9611816Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:35.9612329Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:35.9612792Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:35.9613217Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:35.9613669Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:35.9614114Z Written: lookup_lars_sgd.py 2025-05-07T19:50:35.9614442Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:35.9614927Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:35.9615457Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:35.9616080Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:35.9616704Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:35.9617273Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:35.9617897Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:35.9618682Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:35.9619312Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:35.9619957Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:35.9620636Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:35.9621364Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:35.9622001Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:35.9622685Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:36.0442752Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:36.0444586Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:36.0446092Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:36.0447777Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.0449227Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:36.0450458Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:36.0452124Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.0452731Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:36.0453324Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:36.0453951Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:36.0454527Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:36.0455150Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:36.0455788Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:36.0456392Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:36.0457056Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:36.0457707Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:36.0458358Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:36.0459029Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:36.0459679Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:36.0460306Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:36.0460830Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:36.0461336Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:36.0461883Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.0462381Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:36.0462815Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:36.0463348Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.0463937Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:36.0464477Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:36.0465480Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:36.0466057Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:36.0466676Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:36.0467321Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:36.0468167Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:36.0468796Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:36.0469384Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:36.0469989Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:36.0470741Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:36.0471440Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:36.0472059Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:36.0472623Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:36.0473253Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:36.0473883Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:36.0474543Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:36.0475176Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:36.0475774Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:36.0476370Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:36.0476981Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.0477637Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.0478252Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:36.0478881Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:36.0479539Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:36.0480201Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:36.0480895Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.0481542Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.0482209Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:36.0482830Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:36.0483476Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.0484246Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.0484827Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:36.0485421Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:36.0486022Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:36.0486681Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:36.0487333Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.0487937Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.0488569Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:36.0489156Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:36.0489798Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:36.0490413Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:36.0491061Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:36.0491711Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:36.0492407Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:36.0493044Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:36.0493670Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:36.0494416Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:36.0495045Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:36.0495601Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:36.0496197Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:36.0496773Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:36.0497342Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:36.0497877Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:36.0498393Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:36.0498855Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:36.0499350Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.0499828Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:36.0500210Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:36.0500649Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:36.0501169Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.0501638Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:36.0502007Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:36.0502489Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:36.0502986Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.0503580Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:36.0504120Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:36.0504636Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:36.0505212Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.0505773Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:36.0506348Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.0506976Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:36.0507620Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:36.0508189Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:36.0508845Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.0509516Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:36.0510145Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.0510864Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:36.0511628Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:36.0512604Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:36.0513369Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.1538616Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:36.1540082Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.1541104Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:36.1541734Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:36.1542408Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:36.1543181Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:36.1543856Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:36.1544497Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:36.1545170Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:36.1545852Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:36.1546526Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:36.1547214Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:36.1547879Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.1548577Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:36.1549297Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:36.1550007Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.1550722Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:36.1551532Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.1552492Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:36.1553281Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:36.1554055Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.1554836Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:36.1555548Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:36.1556197Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:36.1556772Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:36.1557439Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.1558131Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:36.1558582Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:36.1559197Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.1559863Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:36.1560514Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:36.1561121Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:36.1561763Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.1562434Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:36.1563074Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.1563687Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:36.1564211Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:36.1564876Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:36.1565947Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.1566528Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:36.1567113Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.1567748Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:36.1568196Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:36.1568659Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:36.1569140Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:36.1569610Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:36.1570064Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:36.1570526Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:36.1571143Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:36.1571730Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:36.1572184Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:36.1572648Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.1573126Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:36.1573596Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:36.1574111Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:36.1574618Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:36.1575099Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.1575594Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:36.1576090Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:36.1576633Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:36.1577116Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:36.1577575Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:36.1577969Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:36.1578309Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:36.1578711Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.1579063Z Written: lookup_sgd.py 2025-05-07T19:50:36.1579332Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:36.1579679Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:36.1580078Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.1580533Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:36.1580966Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:36.1581358Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:36.1581799Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.1582254Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:36.1582694Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.1583156Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:36.1583604Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:36.1584054Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:36.1584480Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:36.1584932Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:36.1585408Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:36.1585860Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:36.1586454Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:36.1586961Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:36.1587447Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:36.1587950Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:36.1589308Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:36.1589794Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:36.1590175Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:36.1590533Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:36.1590938Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.1591411Z Written: lookup_none.py 2025-05-07T19:50:36.1591885Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:36.1592328Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.1592852Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:36.1593411Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:36.1593991Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:36.1594525Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:36.1595050Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:36.1595553Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:36.1596033Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:36.1596551Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:36.1597088Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:36.1597629Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:36.1598153Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:36.1598674Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:36.1599172Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:36.1599662Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:36.1600156Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:36.1600642Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:36.1601170Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:36.1601685Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:36.1602118Z Written: pt2_arg_utils.h 2025-05-07T19:50:36.1602367Z Written: __init__.py 2025-05-07T19:50:36.1602627Z Written: lookup_args_ssd.py 2025-05-07T19:50:36.1602900Z Written: lookup_args.py 2025-05-07T19:50:36.1670689Z 2025-05-07T19:50:36.1670748Z 2025-05-07T19:50:36.1671038Z ================================================================================ 2025-05-07T19:50:36.1671556Z Running code generation script ... 2025-05-07T19:50:36.1672351Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:36.1673193Z ================================================================================ 2025-05-07T19:50:36.1673429Z 2025-05-07T19:50:36.2748353Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:36.2750155Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:36.2750839Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:36.2751398Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:36.2752344Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:36.2752845Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:36.2753327Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:36.2753729Z Written: optimizer_args.py 2025-05-07T19:50:36.2851925Z 2025-05-07T19:50:36.2852038Z 2025-05-07T19:50:36.2853029Z ================================================================================ 2025-05-07T19:50:36.2854149Z Running code generation script ... 2025-05-07T19:50:36.2856486Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:36.2858830Z ================================================================================ 2025-05-07T19:50:36.2859534Z 2025-05-07T19:50:36.4129275Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:36.4130212Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:36.4131152Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:36.4131856Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:36.4132649Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:36.4133329Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:36.4133972Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:36.4134628Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:36.4135320Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:36.4136021Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:36.4136749Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:36.4137442Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:36.4138155Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:36.4138879Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:36.4139555Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:36.4140235Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:36.4140893Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:36.4141571Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:36.4142251Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:36.4142908Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:36.4143558Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:36.4144180Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:36.4144851Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:36.4145402Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:36.4145910Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:36.4229962Z 2025-05-07T19:50:36.4229982Z 2025-05-07T19:50:36.4230726Z ================================================================================ 2025-05-07T19:50:36.4231998Z Running code generation script ... 2025-05-07T19:50:36.4234291Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:36.4235105Z ================================================================================ 2025-05-07T19:50:36.4235346Z 2025-05-07T19:50:36.7644806Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:36.7647007Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:36.7647797Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:36.7648312Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:36.7648914Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:36.7649416Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:36.7649878Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:36.7650370Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:36.7650840Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:36.7651276Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:36.7651749Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:36.7652233Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:36.7652714Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:36.7653161Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:36.7653657Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:36.7654157Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:36.7654649Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:36.7655165Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:36.7655648Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:36.7656128Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:36.7656596Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:36.7657095Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:36.7657578Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:36.7658043Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:36.7658514Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:36.7658951Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:36.7659433Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:36.7659914Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:36.7660402Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:36.7660881Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:36.7661331Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:36.7661771Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:36.7662199Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:36.7662673Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:36.7663101Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:36.7663526Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:36.7663957Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:36.7664368Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:36.7665175Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:36.7665636Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:36.7666288Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:36.7666776Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:36.7667274Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:36.7667731Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:36.7668196Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:36.7668788Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:36.7669265Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:36.7669764Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:36.7670253Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:36.7670746Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:36.7671207Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:36.7671830Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:36.7672480Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:36.7673012Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:36.7673566Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:36.7674069Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.7674546Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:36.7674983Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:36.7762918Z 2025-05-07T19:50:36.7763157Z 2025-05-07T19:50:36.7763657Z ================================================================================ 2025-05-07T19:50:36.7765208Z Running code generation script ... 2025-05-07T19:50:36.7766942Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:36.7767763Z ================================================================================ 2025-05-07T19:50:36.7767999Z 2025-05-07T19:50:37.0495516Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:37.0497368Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:37.0498210Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:37.0498691Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:37.0499122Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:37.0499705Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:37.0500145Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:37.0500597Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:37.0501105Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:37.0501600Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:37.0502052Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:37.0611128Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:37.0626091Z 2025-05-07T19:50:37.0626145Z 2025-05-07T19:50:37.0626603Z ================================================================================ 2025-05-07T19:50:37.0627098Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:37.0627470Z 2025-05-07T19:50:37.0627752Z CPU_SRCS: 2025-05-07T19:50:37.0628173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:37.0628875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:37.0629570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:37.0630185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:37.0632313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:37.0632806Z 2025-05-07T19:50:37.0633029Z GPU_SRCS: 2025-05-07T19:50:37.0633391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:37.0634016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:37.0634789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:37.0635563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:37.0636180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:37.0636862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:37.0637464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:37.0638038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:37.0638583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:37.0639203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:37.0639645Z 2025-05-07T19:50:37.0639847Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.0639984Z 2025-05-07T19:50:37.0640068Z 2025-05-07T19:50:37.0640292Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.0640434Z 2025-05-07T19:50:37.0640522Z 2025-05-07T19:50:37.0640748Z OTHER_SRCS: 2025-05-07T19:50:37.0640875Z 2025-05-07T19:50:37.0640984Z 2025-05-07T19:50:37.0641185Z CC_FLAGS: 2025-05-07T19:50:37.0641305Z 2025-05-07T19:50:37.0641416Z 2025-05-07T19:50:37.0641613Z NVCC_FLAGS: 2025-05-07T19:50:37.0641846Z --expt-relaxed-constexpr 2025-05-07T19:50:37.0642108Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.0642391Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.0642673Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.0642928Z 2025-05-07T19:50:37.0643113Z HIPCC_FLAGS: 2025-05-07T19:50:37.0643256Z 2025-05-07T19:50:37.0643332Z 2025-05-07T19:50:37.0643688Z INCLUDE_DIRS: 2025-05-07T19:50:37.0643940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.0644270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.0644551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.0644878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.0645375Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.0646176Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.0646826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.0647261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.0647700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.0648164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.0648696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.0649154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.0649831Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.0650310Z 2025-05-07T19:50:37.0650515Z Selected Source Files: 2025-05-07T19:50:37.0650923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:37.0651554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:37.0652181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:37.0652747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:37.0653361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:37.0653955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:37.0654601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:37.0655205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:37.0655806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:37.0656398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:37.0657034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:37.0657640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:37.0658213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:37.0658755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:37.0659386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:37.0659829Z 2025-05-07T19:50:37.0660045Z HIPified Source Files: 2025-05-07T19:50:37.0660195Z 2025-05-07T19:50:37.0660270Z 2025-05-07T19:50:37.0660480Z Library Dependencies: 2025-05-07T19:50:37.0660700Z torch 2025-05-07T19:50:37.0660905Z torch_library 2025-05-07T19:50:37.0661335Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.0661973Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.0662655Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.0663409Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.0664121Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.0664885Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.0665480Z 2025-05-07T19:50:37.0665774Z Output Library: 2025-05-07T19:50:37.0666006Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:37.0666260Z 2025-05-07T19:50:37.0666464Z Destination Directory: 2025-05-07T19:50:37.0666730Z fbgemm_gpu 2025-05-07T19:50:37.0666971Z ================================================================================ 2025-05-07T19:50:37.0667224Z 2025-05-07T19:50:37.1140473Z 2025-05-07T19:50:37.1140634Z 2025-05-07T19:50:37.1141158Z ================================================================================ 2025-05-07T19:50:37.1142442Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:37.1143482Z 2025-05-07T19:50:37.1143991Z CPU_SRCS: 2025-05-07T19:50:37.1144865Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:37.1146206Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:37.1147516Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:37.1148119Z 2025-05-07T19:50:37.1148330Z GPU_SRCS: 2025-05-07T19:50:37.1148635Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:37.1149113Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:37.1149695Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:37.1150323Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:37.1150960Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:37.1151800Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:37.1152427Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:37.1153065Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:37.1153719Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:37.1154419Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:37.1155111Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:37.1155993Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:37.1156693Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:37.1157370Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:37.1158346Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:37.1158960Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:37.1159549Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:37.1160160Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:37.1160751Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:37.1161357Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:37.1161924Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:37.1162497Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:37.1163080Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.1163478Z 2025-05-07T19:50:37.1163685Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.1163829Z 2025-05-07T19:50:37.1163905Z 2025-05-07T19:50:37.1164113Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.1164248Z 2025-05-07T19:50:37.1164327Z 2025-05-07T19:50:37.1164528Z OTHER_SRCS: 2025-05-07T19:50:37.1164642Z 2025-05-07T19:50:37.1164920Z 2025-05-07T19:50:37.1165313Z CC_FLAGS: 2025-05-07T19:50:37.1165548Z 2025-05-07T19:50:37.1165649Z 2025-05-07T19:50:37.1165837Z NVCC_FLAGS: 2025-05-07T19:50:37.1166080Z --expt-relaxed-constexpr 2025-05-07T19:50:37.1166357Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.1166662Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.1166967Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.1167241Z 2025-05-07T19:50:37.1167432Z HIPCC_FLAGS: 2025-05-07T19:50:37.1167580Z 2025-05-07T19:50:37.1167663Z 2025-05-07T19:50:37.1167852Z INCLUDE_DIRS: 2025-05-07T19:50:37.1168109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.1168445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.1168736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.1169071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.1169574Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.1170392Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.1171050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.1171602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.1172013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.1172481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.1172990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.1173421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.1173961Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.1174433Z 2025-05-07T19:50:37.1174644Z Selected Source Files: 2025-05-07T19:50:37.1174961Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:37.1175410Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:37.1175851Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:37.1176260Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:37.1176714Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:37.1177241Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:37.1177954Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:37.1178532Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:37.1179125Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:37.1179718Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:37.1180364Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:37.1180980Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:37.1181609Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:37.1182254Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:37.1182879Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:37.1183526Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:37.1184167Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:37.1184772Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:37.1185375Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:37.1185963Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:37.1186564Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:37.1187163Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:37.1187748Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:37.1188328Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:37.1188881Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:37.1189458Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.1189855Z 2025-05-07T19:50:37.1190060Z HIPified Source Files: 2025-05-07T19:50:37.1190238Z 2025-05-07T19:50:37.1190314Z 2025-05-07T19:50:37.1190503Z Library Dependencies: 2025-05-07T19:50:37.1190749Z torch 2025-05-07T19:50:37.1190933Z torch_library 2025-05-07T19:50:37.1191459Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.1192338Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.1193049Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.1193878Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.1194632Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.1195133Z asmjit 2025-05-07T19:50:37.1195332Z fbgemm 2025-05-07T19:50:37.1195567Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:37.1195815Z fbgemm_gpu_config 2025-05-07T19:50:37.1196198Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.1196632Z 2025-05-07T19:50:37.1196836Z Output Library: 2025-05-07T19:50:37.1197107Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:37.1197361Z 2025-05-07T19:50:37.1197601Z Destination Directory: 2025-05-07T19:50:37.1197851Z fbgemm_gpu 2025-05-07T19:50:37.1198108Z ================================================================================ 2025-05-07T19:50:37.1198347Z 2025-05-07T19:50:37.3568626Z 2025-05-07T19:50:37.3568645Z 2025-05-07T19:50:37.3569426Z ================================================================================ 2025-05-07T19:50:37.3570668Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:37.3571632Z 2025-05-07T19:50:37.3572147Z CPU_SRCS: 2025-05-07T19:50:37.3573174Z src/config/feature_gates.cpp 2025-05-07T19:50:37.3573433Z 2025-05-07T19:50:37.3573642Z GPU_SRCS: 2025-05-07T19:50:37.3573875Z 2025-05-07T19:50:37.3573973Z 2025-05-07T19:50:37.3574168Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3574310Z 2025-05-07T19:50:37.3574406Z 2025-05-07T19:50:37.3574593Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3574844Z 2025-05-07T19:50:37.3574941Z 2025-05-07T19:50:37.3575117Z OTHER_SRCS: 2025-05-07T19:50:37.3575246Z 2025-05-07T19:50:37.3575415Z 2025-05-07T19:50:37.3575595Z CC_FLAGS: 2025-05-07T19:50:37.3575724Z 2025-05-07T19:50:37.3575799Z 2025-05-07T19:50:37.3575976Z NVCC_FLAGS: 2025-05-07T19:50:37.3576110Z 2025-05-07T19:50:37.3576186Z 2025-05-07T19:50:37.3576555Z HIPCC_FLAGS: 2025-05-07T19:50:37.3576679Z 2025-05-07T19:50:37.3576759Z 2025-05-07T19:50:37.3576964Z INCLUDE_DIRS: 2025-05-07T19:50:37.3577198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3577715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3578004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3578342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3578846Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3579669Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3580350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3580780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3581233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3581715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3582259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3582722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3583309Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3583835Z 2025-05-07T19:50:37.3584044Z Selected Source Files: 2025-05-07T19:50:37.3584319Z src/config/feature_gates.cpp 2025-05-07T19:50:37.3584574Z 2025-05-07T19:50:37.3584794Z HIPified Source Files: 2025-05-07T19:50:37.3584950Z 2025-05-07T19:50:37.3585032Z 2025-05-07T19:50:37.3585252Z Library Dependencies: 2025-05-07T19:50:37.3585490Z torch 2025-05-07T19:50:37.3585706Z torch_library 2025-05-07T19:50:37.3586151Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3586961Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3587662Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3588632Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3589395Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3590005Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3590433Z 2025-05-07T19:50:37.3590645Z Output Library: 2025-05-07T19:50:37.3590885Z fbgemm_gpu_config 2025-05-07T19:50:37.3591100Z 2025-05-07T19:50:37.3591448Z Destination Directory: 2025-05-07T19:50:37.3591714Z fbgemm_gpu 2025-05-07T19:50:37.3591946Z ================================================================================ 2025-05-07T19:50:37.3592257Z 2025-05-07T19:50:37.3592342Z 2025-05-07T19:50:37.3592351Z 2025-05-07T19:50:37.3592465Z ================================================================================ 2025-05-07T19:50:37.3592862Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:37.3593199Z 2025-05-07T19:50:37.3593405Z CPU_SRCS: 2025-05-07T19:50:37.3593716Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:37.3594179Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:37.3594561Z 2025-05-07T19:50:37.3594751Z GPU_SRCS: 2025-05-07T19:50:37.3595045Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:37.3595574Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:37.3595996Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:37.3596376Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:37.3596799Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:37.3597165Z 2025-05-07T19:50:37.3597364Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3597674Z 2025-05-07T19:50:37.3597768Z 2025-05-07T19:50:37.3598125Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3598267Z 2025-05-07T19:50:37.3598366Z 2025-05-07T19:50:37.3598552Z OTHER_SRCS: 2025-05-07T19:50:37.3598690Z 2025-05-07T19:50:37.3598771Z 2025-05-07T19:50:37.3598961Z CC_FLAGS: 2025-05-07T19:50:37.3599101Z 2025-05-07T19:50:37.3599180Z 2025-05-07T19:50:37.3599368Z NVCC_FLAGS: 2025-05-07T19:50:37.3599525Z 2025-05-07T19:50:37.3599783Z 2025-05-07T19:50:37.3600024Z HIPCC_FLAGS: 2025-05-07T19:50:37.3600164Z 2025-05-07T19:50:37.3600257Z 2025-05-07T19:50:37.3600500Z INCLUDE_DIRS: 2025-05-07T19:50:37.3600765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3601142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3601459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3601833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3602359Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3603215Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3603924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3604365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3604857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3605346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3605895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3606365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3606958Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3607492Z 2025-05-07T19:50:37.3607700Z Selected Source Files: 2025-05-07T19:50:37.3608054Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:37.3608520Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:37.3608993Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:37.3609401Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:37.3609817Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:37.3610197Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:37.3610640Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:37.3611034Z 2025-05-07T19:50:37.3611252Z HIPified Source Files: 2025-05-07T19:50:37.3611421Z 2025-05-07T19:50:37.3611533Z 2025-05-07T19:50:37.3611749Z Library Dependencies: 2025-05-07T19:50:37.3612022Z torch 2025-05-07T19:50:37.3612241Z torch_library 2025-05-07T19:50:37.3612729Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3613434Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3614182Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3615037Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3615806Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3616459Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3616883Z 2025-05-07T19:50:37.3617120Z Output Library: 2025-05-07T19:50:37.3617365Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:37.3617630Z 2025-05-07T19:50:37.3617847Z Destination Directory: 2025-05-07T19:50:37.3618121Z fbgemm_gpu 2025-05-07T19:50:37.3618434Z ================================================================================ 2025-05-07T19:50:37.3618689Z 2025-05-07T19:50:37.3618693Z 2025-05-07T19:50:37.3618697Z 2025-05-07T19:50:37.3618813Z ================================================================================ 2025-05-07T19:50:37.3619256Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:37.3619639Z 2025-05-07T19:50:37.3619910Z CPU_SRCS: 2025-05-07T19:50:37.3620150Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:37.3620459Z 2025-05-07T19:50:37.3620652Z GPU_SRCS: 2025-05-07T19:50:37.3620908Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:37.3621215Z 2025-05-07T19:50:37.3621415Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3621564Z 2025-05-07T19:50:37.3621669Z 2025-05-07T19:50:37.3621868Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3622010Z 2025-05-07T19:50:37.3622107Z 2025-05-07T19:50:37.3622298Z OTHER_SRCS: 2025-05-07T19:50:37.3622439Z 2025-05-07T19:50:37.3622524Z 2025-05-07T19:50:37.3622713Z CC_FLAGS: 2025-05-07T19:50:37.3622851Z 2025-05-07T19:50:37.3622930Z 2025-05-07T19:50:37.3623121Z NVCC_FLAGS: 2025-05-07T19:50:37.3623260Z 2025-05-07T19:50:37.3623340Z 2025-05-07T19:50:37.3623545Z HIPCC_FLAGS: 2025-05-07T19:50:37.3623674Z 2025-05-07T19:50:37.3623753Z 2025-05-07T19:50:37.3623962Z INCLUDE_DIRS: 2025-05-07T19:50:37.3624206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3624547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3624835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3625284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3625743Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3626491Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3627114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3627502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3627923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3628365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3628864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3629291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3630021Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3630528Z 2025-05-07T19:50:37.3630726Z Selected Source Files: 2025-05-07T19:50:37.3631188Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:37.3631616Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:37.3631923Z 2025-05-07T19:50:37.3632127Z HIPified Source Files: 2025-05-07T19:50:37.3632306Z 2025-05-07T19:50:37.3632394Z 2025-05-07T19:50:37.3632660Z Library Dependencies: 2025-05-07T19:50:37.3632916Z torch 2025-05-07T19:50:37.3633122Z torch_library 2025-05-07T19:50:37.3633582Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3634282Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3634981Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3635821Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3636586Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3637086Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:37.3637465Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3637890Z 2025-05-07T19:50:37.3638115Z Output Library: 2025-05-07T19:50:37.3638361Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:37.3638651Z 2025-05-07T19:50:37.3638867Z Destination Directory: 2025-05-07T19:50:37.3639135Z fbgemm_gpu 2025-05-07T19:50:37.3639372Z ================================================================================ 2025-05-07T19:50:37.3639718Z 2025-05-07T19:50:37.3639850Z 2025-05-07T19:50:37.3639854Z 2025-05-07T19:50:37.3639974Z ================================================================================ 2025-05-07T19:50:37.3640384Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:37.3640731Z 2025-05-07T19:50:37.3640947Z CPU_SRCS: 2025-05-07T19:50:37.3641270Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:37.3641725Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:37.3642145Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:37.3642479Z 2025-05-07T19:50:37.3642673Z GPU_SRCS: 2025-05-07T19:50:37.3642938Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:37.3643291Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:37.3643670Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:37.3644010Z 2025-05-07T19:50:37.3644215Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3644367Z 2025-05-07T19:50:37.3644471Z 2025-05-07T19:50:37.3644665Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3644809Z 2025-05-07T19:50:37.3644910Z 2025-05-07T19:50:37.3645100Z OTHER_SRCS: 2025-05-07T19:50:37.3645241Z 2025-05-07T19:50:37.3645320Z 2025-05-07T19:50:37.3645504Z CC_FLAGS: 2025-05-07T19:50:37.3645636Z 2025-05-07T19:50:37.3645714Z 2025-05-07T19:50:37.3645901Z NVCC_FLAGS: 2025-05-07T19:50:37.3646136Z --expt-relaxed-constexpr 2025-05-07T19:50:37.3646427Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.3646712Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.3647024Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.3647280Z 2025-05-07T19:50:37.3647488Z HIPCC_FLAGS: 2025-05-07T19:50:37.3647615Z 2025-05-07T19:50:37.3647702Z 2025-05-07T19:50:37.3647915Z INCLUDE_DIRS: 2025-05-07T19:50:37.3648160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3648494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3648780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3649114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3649625Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3650422Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3651088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3651512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3651961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3652551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3653081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3653552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3654102Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3654614Z 2025-05-07T19:50:37.3654813Z Selected Source Files: 2025-05-07T19:50:37.3655133Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:37.3655550Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:37.3655966Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:37.3656316Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:37.3656681Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:37.3657227Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:37.3657545Z 2025-05-07T19:50:37.3657768Z HIPified Source Files: 2025-05-07T19:50:37.3657928Z 2025-05-07T19:50:37.3658013Z 2025-05-07T19:50:37.3658229Z Library Dependencies: 2025-05-07T19:50:37.3658463Z torch 2025-05-07T19:50:37.3658680Z torch_library 2025-05-07T19:50:37.3659128Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3659834Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3660570Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3661450Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3662220Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3662695Z fbgemm 2025-05-07T19:50:37.3662918Z fbgemm_gpu_config 2025-05-07T19:50:37.3663337Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3663764Z 2025-05-07T19:50:37.3663962Z Output Library: 2025-05-07T19:50:37.3664206Z fbgemm_gpu_tbe_common 2025-05-07T19:50:37.3664468Z 2025-05-07T19:50:37.3664885Z Destination Directory: 2025-05-07T19:50:37.3665150Z fbgemm_gpu 2025-05-07T19:50:37.3665382Z ================================================================================ 2025-05-07T19:50:37.3665616Z 2025-05-07T19:50:37.3665620Z 2025-05-07T19:50:37.3665641Z 2025-05-07T19:50:37.3665754Z ================================================================================ 2025-05-07T19:50:37.3666155Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:37.3666531Z 2025-05-07T19:50:37.3666725Z CPU_SRCS: 2025-05-07T19:50:37.3666869Z 2025-05-07T19:50:37.3666950Z 2025-05-07T19:50:37.3667168Z GPU_SRCS: 2025-05-07T19:50:37.3667440Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:37.3667867Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:37.3668298Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:37.3668659Z 2025-05-07T19:50:37.3680614Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3680796Z 2025-05-07T19:50:37.3680904Z 2025-05-07T19:50:37.3681110Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3681278Z 2025-05-07T19:50:37.3681368Z 2025-05-07T19:50:37.3681564Z OTHER_SRCS: 2025-05-07T19:50:37.3681710Z 2025-05-07T19:50:37.3681796Z 2025-05-07T19:50:37.3681988Z CC_FLAGS: 2025-05-07T19:50:37.3682127Z 2025-05-07T19:50:37.3682208Z 2025-05-07T19:50:37.3682422Z NVCC_FLAGS: 2025-05-07T19:50:37.3682673Z --expt-relaxed-constexpr 2025-05-07T19:50:37.3682979Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.3683273Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.3683598Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.3683863Z 2025-05-07T19:50:37.3684084Z HIPCC_FLAGS: 2025-05-07T19:50:37.3684218Z 2025-05-07T19:50:37.3684300Z 2025-05-07T19:50:37.3684520Z INCLUDE_DIRS: 2025-05-07T19:50:37.3684767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3685105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3685397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3685735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3686257Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3687055Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3687734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3688157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3688609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3689087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3689632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3690118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3690692Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3691220Z 2025-05-07T19:50:37.3691425Z Selected Source Files: 2025-05-07T19:50:37.3691754Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:37.3692282Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:37.3692715Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:37.3693168Z 2025-05-07T19:50:37.3693377Z HIPified Source Files: 2025-05-07T19:50:37.3693527Z 2025-05-07T19:50:37.3693847Z 2025-05-07T19:50:37.3694043Z Library Dependencies: 2025-05-07T19:50:37.3694290Z torch 2025-05-07T19:50:37.3694484Z torch_library 2025-05-07T19:50:37.3694920Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3695559Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3696307Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3697051Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3697765Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3698352Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3698730Z 2025-05-07T19:50:37.3698939Z Output Library: 2025-05-07T19:50:37.3699173Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:37.3699427Z 2025-05-07T19:50:37.3699624Z Destination Directory: 2025-05-07T19:50:37.3699879Z fbgemm_gpu 2025-05-07T19:50:37.3700103Z ================================================================================ 2025-05-07T19:50:37.3700339Z 2025-05-07T19:50:37.3700344Z 2025-05-07T19:50:37.3700348Z 2025-05-07T19:50:37.3700456Z ================================================================================ 2025-05-07T19:50:37.3700868Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:37.3701227Z 2025-05-07T19:50:37.3701428Z CPU_SRCS: 2025-05-07T19:50:37.3701674Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3702000Z 2025-05-07T19:50:37.3702181Z GPU_SRCS: 2025-05-07T19:50:37.3702438Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:37.3702788Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:37.3703151Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:37.3703536Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:37.3704129Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:37.3704566Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:37.3704966Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:37.3705358Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:37.3705722Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:37.3706117Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:37.3706541Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:37.3706955Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.3707548Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:37.3707952Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:37.3708378Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:37.3708789Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.3709216Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:37.3709805Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:37.3710223Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:37.3710649Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.3711075Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:37.3711653Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3712103Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3712614Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:37.3713008Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:37.3713443Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3713918Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3714363Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:37.3714789Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3715292Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:37.3715719Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:37.3716137Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3716581Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3716984Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:37.3717466Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3717948Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3718398Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:37.3718823Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:37.3719259Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3719752Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3720207Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:37.3720657Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3721125Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:37.3721550Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:37.3722008Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3722446Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3722889Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:37.3723324Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:37.3723819Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:37.3724291Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:37.3724731Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3725127Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3725443Z 2025-05-07T19:50:37.3725668Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3725820Z 2025-05-07T19:50:37.3725908Z 2025-05-07T19:50:37.3726120Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3726261Z 2025-05-07T19:50:37.3726340Z 2025-05-07T19:50:37.3726545Z OTHER_SRCS: 2025-05-07T19:50:37.3726663Z 2025-05-07T19:50:37.3726743Z 2025-05-07T19:50:37.3726954Z CC_FLAGS: 2025-05-07T19:50:37.3727072Z 2025-05-07T19:50:37.3727174Z 2025-05-07T19:50:37.3727363Z NVCC_FLAGS: 2025-05-07T19:50:37.3727604Z --expt-relaxed-constexpr 2025-05-07T19:50:37.3727882Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.3728191Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.3728493Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.3728774Z 2025-05-07T19:50:37.3728971Z HIPCC_FLAGS: 2025-05-07T19:50:37.3729116Z 2025-05-07T19:50:37.3729189Z 2025-05-07T19:50:37.3729406Z INCLUDE_DIRS: 2025-05-07T19:50:37.3729646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3729975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3730279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3730590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3731103Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3731899Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3732572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3732987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3733441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3733931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3734452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3734933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3735491Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3736015Z 2025-05-07T19:50:37.3736219Z Selected Source Files: 2025-05-07T19:50:37.3736618Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3737023Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:37.3737457Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:37.3737894Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:37.3738316Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:37.3738805Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:37.3739328Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:37.3739760Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3740194Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3740649Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3741113Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:37.3741697Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3742099Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3742470Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:37.3742850Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:37.3743213Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:37.3743619Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:37.3744052Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:37.3744490Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:37.3744901Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:37.3745276Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:37.3745666Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:37.3746051Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:37.3746482Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.3746893Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:37.3747320Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.3747719Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:37.3748129Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:37.3748561Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3748972Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:37.3749373Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:37.3749786Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3750248Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3750685Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:37.3751116Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3751646Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:37.3752055Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:37.3752498Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3752894Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:37.3753320Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3753750Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:37.3754168Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:37.3754589Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3755076Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:37.3755542Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:37.3755971Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3756416Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:37.3756835Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:37.3757274Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:37.3757678Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:37.3758181Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:37.3758643Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:37.3759097Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:37.3759446Z 2025-05-07T19:50:37.3759650Z HIPified Source Files: 2025-05-07T19:50:37.3759801Z 2025-05-07T19:50:37.3759878Z 2025-05-07T19:50:37.3761121Z Library Dependencies: 2025-05-07T19:50:37.3761365Z torch 2025-05-07T19:50:37.3761559Z torch_library 2025-05-07T19:50:37.3761990Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3762680Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3763378Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3764177Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3765122Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3765601Z fbgemm_gpu_tbe_common 2025-05-07T19:50:37.3765972Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3766376Z 2025-05-07T19:50:37.3766559Z Output Library: 2025-05-07T19:50:37.3766805Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:37.3767055Z 2025-05-07T19:50:37.3767265Z Destination Directory: 2025-05-07T19:50:37.3767494Z fbgemm_gpu 2025-05-07T19:50:37.3767917Z ================================================================================ 2025-05-07T19:50:37.3768142Z 2025-05-07T19:50:37.3768146Z 2025-05-07T19:50:37.3768151Z 2025-05-07T19:50:37.3768260Z ================================================================================ 2025-05-07T19:50:37.3768698Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:37.3769091Z 2025-05-07T19:50:37.3769273Z CPU_SRCS: 2025-05-07T19:50:37.3769515Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3769932Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3770508Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:37.3770832Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:37.3771189Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:37.3771526Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:37.3771933Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:37.3772373Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:37.3772763Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:37.3773173Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:37.3773600Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:37.3774023Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3774515Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:37.3775097Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:37.3775675Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:37.3776176Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3776610Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3777016Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3777484Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3777932Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3778344Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3778748Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3779175Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3779801Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3780489Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3780977Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3781470Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3782012Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3782513Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3783262Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3783939Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3784597Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3785203Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3785617Z 2025-05-07T19:50:37.3785809Z GPU_SRCS: 2025-05-07T19:50:37.3786086Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3786574Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3787031Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3787447Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3787855Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3788282Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3788782Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3789325Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3789808Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3790323Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3790860Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3791454Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3792064Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3792749Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3793412Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3794030Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3794563Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3794927Z 2025-05-07T19:50:37.3795117Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3795257Z 2025-05-07T19:50:37.3795331Z 2025-05-07T19:50:37.3795518Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3795653Z 2025-05-07T19:50:37.3795727Z 2025-05-07T19:50:37.3795916Z OTHER_SRCS: 2025-05-07T19:50:37.3796033Z 2025-05-07T19:50:37.3796103Z 2025-05-07T19:50:37.3796280Z CC_FLAGS: 2025-05-07T19:50:37.3796389Z 2025-05-07T19:50:37.3796472Z 2025-05-07T19:50:37.3796648Z NVCC_FLAGS: 2025-05-07T19:50:37.3796885Z --expt-relaxed-constexpr 2025-05-07T19:50:37.3797158Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.3797461Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.3797759Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.3798026Z 2025-05-07T19:50:37.3798215Z HIPCC_FLAGS: 2025-05-07T19:50:37.3798362Z 2025-05-07T19:50:37.3798443Z 2025-05-07T19:50:37.3798634Z INCLUDE_DIRS: 2025-05-07T19:50:37.3798891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3799208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3799515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3799847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3800343Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3801155Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3801814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3802328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3802771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3803270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3803925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3804424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3804997Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3805496Z 2025-05-07T19:50:37.3805714Z Selected Source Files: 2025-05-07T19:50:37.3805995Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3806396Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3806764Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:37.3807115Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:37.3807467Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:37.3807977Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:37.3808382Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:37.3808812Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:37.3809207Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:37.3809621Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:37.3810067Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:37.3810493Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3810985Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:37.3811562Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:37.3812122Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:37.3812635Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3813063Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:37.3813489Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3813959Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3814407Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3814827Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3815238Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3815669Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3816148Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3816690Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3817165Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3817668Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3818205Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3818705Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3819314Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3819973Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3820645Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3821252Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:37.3821750Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3822229Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3822680Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3823098Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3823502Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3824011Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3824500Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3825056Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3825547Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3826085Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3826630Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3827134Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3827743Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3828412Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3829086Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3829722Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3830254Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:37.3830649Z 2025-05-07T19:50:37.3830851Z HIPified Source Files: 2025-05-07T19:50:37.3831022Z 2025-05-07T19:50:37.3831102Z 2025-05-07T19:50:37.3831381Z Library Dependencies: 2025-05-07T19:50:37.3831802Z torch 2025-05-07T19:50:37.3832003Z torch_library 2025-05-07T19:50:37.3832470Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3833172Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3833890Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3834712Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3835455Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3835940Z fbgemm 2025-05-07T19:50:37.3836140Z fbgemm_gpu_config 2025-05-07T19:50:37.3836385Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:37.3836641Z fbgemm_gpu_tbe_common 2025-05-07T19:50:37.3836881Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:37.3837150Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:37.3837547Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3837966Z 2025-05-07T19:50:37.3838170Z Output Library: 2025-05-07T19:50:37.3838433Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:37.3838717Z 2025-05-07T19:50:37.3838934Z Destination Directory: 2025-05-07T19:50:37.3839171Z fbgemm_gpu 2025-05-07T19:50:37.3839412Z ================================================================================ 2025-05-07T19:50:37.3839645Z 2025-05-07T19:50:37.3839649Z 2025-05-07T19:50:37.3839653Z 2025-05-07T19:50:37.3839780Z ================================================================================ 2025-05-07T19:50:37.3840208Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:37.3840603Z 2025-05-07T19:50:37.3840789Z CPU_SRCS: 2025-05-07T19:50:37.3841127Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:37.3841555Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:37.3841897Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:37.3842269Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:37.3842639Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:37.3842977Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:37.3843300Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:37.3843639Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:37.3844160Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:37.3844770Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:37.3845160Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:37.3845578Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:37.3846104Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:37.3846520Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:37.3847035Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:37.3847611Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:37.3848266Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:37.3848770Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:37.3849194Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:37.3849576Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:37.3849941Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:37.3850235Z 2025-05-07T19:50:37.3850407Z GPU_SRCS: 2025-05-07T19:50:37.3850660Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:37.3851077Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:37.3851531Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:37.3851966Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:37.3852405Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:37.3852881Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:37.3853380Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:37.3853891Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3854427Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3854997Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3855517Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:37.3856016Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3856548Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3857012Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:37.3857439Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3857887Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3858357Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3858849Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3859369Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3859850Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:37.3860288Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3860764Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3861340Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:37.3861826Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3862336Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3862850Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3863394Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3863959Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3864500Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:37.3865345Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3865892Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3866357Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:37.3866770Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3867201Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3867726Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3868199Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3868695Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3869151Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:37.3869561Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3870064Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3870482Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:37.3870895Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3871413Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3871849Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3872324Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3872818Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3873285Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:37.3873706Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3874166Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3874588Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:37.3874989Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3875429Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3875855Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3876328Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3876813Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3877266Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:37.3877696Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3878148Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3878587Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:37.3879010Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3879477Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3879945Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3880459Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3880995Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3881480Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:37.3881935Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3882413Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3882912Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:37.3883435Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3884000Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3884564Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3885182Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3885821Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3886400Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:37.3886950Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3887528Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3888087Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:37.3888729Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3889298Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3889930Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3890519Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3891141Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3891770Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:37.3892324Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3892914Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3893403Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:37.3893817Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3894246Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3894694Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3895164Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3895664Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3896108Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:37.3896519Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3896959Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3897461Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:37.3898044Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3898652Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3899295Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3899935Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3900603Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3901231Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:37.3901819Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3902464Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3903069Z 2025-05-07T19:50:37.3903274Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3903419Z 2025-05-07T19:50:37.3903511Z 2025-05-07T19:50:37.3903698Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3904046Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:37.3904533Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:37.3904990Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:37.3905355Z 2025-05-07T19:50:37.3905543Z OTHER_SRCS: 2025-05-07T19:50:37.3905666Z 2025-05-07T19:50:37.3905752Z 2025-05-07T19:50:37.3905938Z CC_FLAGS: 2025-05-07T19:50:37.3906046Z 2025-05-07T19:50:37.3906130Z 2025-05-07T19:50:37.3906310Z NVCC_FLAGS: 2025-05-07T19:50:37.3906535Z --expt-relaxed-constexpr 2025-05-07T19:50:37.3906802Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.3907079Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.3907363Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.3907608Z 2025-05-07T19:50:37.3907786Z HIPCC_FLAGS: 2025-05-07T19:50:37.3907911Z 2025-05-07T19:50:37.3907981Z 2025-05-07T19:50:37.3908148Z INCLUDE_DIRS: 2025-05-07T19:50:37.3908371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3908665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3908939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3909237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3909704Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3910470Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3911149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3911802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3912223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3912696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3913268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3913724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3914281Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3914774Z 2025-05-07T19:50:37.3914975Z Selected Source Files: 2025-05-07T19:50:37.3915326Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:37.3915764Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:37.3916103Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:37.3916485Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:37.3916857Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:37.3917182Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:37.3917514Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:37.3917855Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:37.3918257Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:37.3918686Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:37.3919071Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:37.3919479Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:37.3919915Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:37.3920328Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:37.3920831Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:37.3921415Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:37.3921986Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:37.3922496Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:37.3922907Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:37.3923287Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:37.3923658Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:37.3924106Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:37.3924495Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:37.3924902Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:37.3925315Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:37.3925711Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:37.3926153Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:37.3926627Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:37.3927094Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3927594Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3928117Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3928616Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:37.3929069Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3929555Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3930008Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:37.3930404Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3930840Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3931272Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3931813Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3932295Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3932751Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:37.3933173Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3933660Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3934117Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:37.3934576Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3935067Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3935553Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3936068Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3936613Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3937128Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:37.3937611Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3938110Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3938547Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:37.3938921Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3939316Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3939713Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3940141Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3940599Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3941009Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:37.3941398Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3941801Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3942187Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:37.3942558Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3942948Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3943353Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3943784Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3944254Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3944671Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:37.3945065Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3945491Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3945903Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:37.3946301Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3946710Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3947147Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3947599Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3948083Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3948512Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:37.3948907Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3949324Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3949732Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:37.3950140Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3950567Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3951014Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3951622Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3952389Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3952880Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:37.3953348Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3953850Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3954400Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:37.3954950Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3955513Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3956095Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3956688Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3957319Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3957913Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:37.3958452Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3959039Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3959237Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:37.3959469Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3959708Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3959935Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3960188Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3960457Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3960663Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:37.3960891Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3961133Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3961274Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:37.3961435Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3961601Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3961776Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3961970Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3962164Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3962324Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:37.3962491Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3962661Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3962901Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:37.3963154Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.3963406Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.3963666Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.3963959Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.3964241Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.3964477Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:37.3964896Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.3965158Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.3965412Z 2025-05-07T19:50:37.3965521Z HIPified Source Files: 2025-05-07T19:50:37.3965527Z 2025-05-07T19:50:37.3965602Z 2025-05-07T19:50:37.3965695Z Library Dependencies: 2025-05-07T19:50:37.3965784Z torch 2025-05-07T19:50:37.3965867Z torch_library 2025-05-07T19:50:37.3966180Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3966496Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3966842Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3967193Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3967464Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3967549Z fbgemm 2025-05-07T19:50:37.3967639Z fbgemm_gpu_config 2025-05-07T19:50:37.3967728Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:37.3967822Z fbgemm_gpu_tbe_common 2025-05-07T19:50:37.3967922Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:37.3968029Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:37.3968239Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3968331Z 2025-05-07T19:50:37.3968418Z Output Library: 2025-05-07T19:50:37.3968526Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:37.3968601Z 2025-05-07T19:50:37.3968707Z Destination Directory: 2025-05-07T19:50:37.3968793Z fbgemm_gpu 2025-05-07T19:50:37.3968904Z ================================================================================ 2025-05-07T19:50:37.3968909Z 2025-05-07T19:50:37.3968913Z 2025-05-07T19:50:37.3968918Z 2025-05-07T19:50:37.3969039Z ================================================================================ 2025-05-07T19:50:37.3969247Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:37.3969321Z 2025-05-07T19:50:37.3969410Z CPU_SRCS: 2025-05-07T19:50:37.3969415Z 2025-05-07T19:50:37.3969495Z 2025-05-07T19:50:37.3969577Z GPU_SRCS: 2025-05-07T19:50:37.3969781Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:37.3970007Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:37.3970231Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:37.3970434Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:37.3970674Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:37.3970899Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:37.3971107Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:37.3971345Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:37.3971576Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:37.3971796Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:37.3972054Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:37.3972298Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:37.3972375Z 2025-05-07T19:50:37.3972469Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3972486Z 2025-05-07T19:50:37.3972555Z 2025-05-07T19:50:37.3972640Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3972648Z 2025-05-07T19:50:37.3972720Z 2025-05-07T19:50:37.3972813Z OTHER_SRCS: 2025-05-07T19:50:37.3972817Z 2025-05-07T19:50:37.3972894Z 2025-05-07T19:50:37.3972975Z CC_FLAGS: 2025-05-07T19:50:37.3972979Z 2025-05-07T19:50:37.3973062Z 2025-05-07T19:50:37.3973142Z NVCC_FLAGS: 2025-05-07T19:50:37.3973243Z --expt-relaxed-constexpr 2025-05-07T19:50:37.3973349Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.3973463Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.3973560Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.3973634Z 2025-05-07T19:50:37.3973773Z HIPCC_FLAGS: 2025-05-07T19:50:37.3973778Z 2025-05-07T19:50:37.3973852Z 2025-05-07T19:50:37.3973935Z INCLUDE_DIRS: 2025-05-07T19:50:37.3974048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3974152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3974256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3974362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3974698Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3975093Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3975239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3975416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3975576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3975779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3975987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3976149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3976569Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3976648Z 2025-05-07T19:50:37.3976756Z Selected Source Files: 2025-05-07T19:50:37.3977061Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:37.3977265Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:37.3977470Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:37.3977672Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:37.3977883Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:37.3978092Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:37.3978302Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:37.3978518Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:37.3978732Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:37.3978946Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:37.3979169Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:37.3979393Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:37.3979477Z 2025-05-07T19:50:37.3979562Z HIPified Source Files: 2025-05-07T19:50:37.3979567Z 2025-05-07T19:50:37.3979636Z 2025-05-07T19:50:37.3979722Z Library Dependencies: 2025-05-07T19:50:37.3979799Z torch 2025-05-07T19:50:37.3979871Z torch_library 2025-05-07T19:50:37.3980157Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.3980411Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.3980714Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.3981036Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.3981302Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.3981399Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:37.3981594Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.3981660Z 2025-05-07T19:50:37.3981750Z Output Library: 2025-05-07T19:50:37.3981848Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:37.3981916Z 2025-05-07T19:50:37.3982013Z Destination Directory: 2025-05-07T19:50:37.3982083Z fbgemm_gpu 2025-05-07T19:50:37.3982182Z ================================================================================ 2025-05-07T19:50:37.3982187Z 2025-05-07T19:50:37.3982191Z 2025-05-07T19:50:37.3982250Z 2025-05-07T19:50:37.3982361Z ================================================================================ 2025-05-07T19:50:37.3982547Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:37.3982615Z 2025-05-07T19:50:37.3982689Z CPU_SRCS: 2025-05-07T19:50:37.3982693Z 2025-05-07T19:50:37.3982778Z 2025-05-07T19:50:37.3982847Z GPU_SRCS: 2025-05-07T19:50:37.3983666Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3983866Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3984056Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3984238Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3984479Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3984718Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3984865Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3985016Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3985178Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3985335Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3985478Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3985645Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3985824Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.3986026Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3986242Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3986420Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:37.3986614Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3986811Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3987012Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.3987218Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3987427Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3987619Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.3987825Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3988028Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3988261Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.3988506Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3988752Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3988993Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.3989248Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3989500Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3989642Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.3989817Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3989981Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3990126Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.3990305Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3990477Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3990620Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.3990798Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3991014Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3991164Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.3991441Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3991794Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3991949Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.3992179Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3992381Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3992540Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.3992723Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.3992926Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.3993007Z 2025-05-07T19:50:37.3993095Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.3993100Z 2025-05-07T19:50:37.3993179Z 2025-05-07T19:50:37.3993283Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.3993287Z 2025-05-07T19:50:37.3993360Z 2025-05-07T19:50:37.3993441Z OTHER_SRCS: 2025-05-07T19:50:37.3993628Z 2025-05-07T19:50:37.3993715Z 2025-05-07T19:50:37.3993793Z CC_FLAGS: 2025-05-07T19:50:37.3993797Z 2025-05-07T19:50:37.3993870Z 2025-05-07T19:50:37.3993965Z NVCC_FLAGS: 2025-05-07T19:50:37.3994063Z --expt-relaxed-constexpr 2025-05-07T19:50:37.3994165Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.3994270Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.3994384Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.3994458Z 2025-05-07T19:50:37.3994541Z HIPCC_FLAGS: 2025-05-07T19:50:37.3994545Z 2025-05-07T19:50:37.3994632Z 2025-05-07T19:50:37.3994713Z INCLUDE_DIRS: 2025-05-07T19:50:37.3994821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3994916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.3995034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.3995140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.3995425Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.3995833Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.3995975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.3996137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.3996304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.3996511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.3996709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.3996856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.3997183Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.3997255Z 2025-05-07T19:50:37.3997347Z Selected Source Files: 2025-05-07T19:50:37.3997563Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3997754Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3997957Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3998166Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3998417Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3998671Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3998822Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3998992Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3999152Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3999321Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3999483Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:37.3999695Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:37.3999887Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.4000117Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4000340Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4000569Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:37.4000796Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4001010Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4001206Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.4001430Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4001667Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4001857Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.4002077Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4002311Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4002555Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.4002822Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4003102Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4003353Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.4003624Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4004025Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4004170Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.4004332Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4004494Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4004798Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.4004967Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4005137Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4005292Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.4005460Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4005629Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4005788Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.4005965Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4006143Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4006281Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:37.4006464Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4006627Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4006785Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:37.4006974Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:37.4007155Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:37.4007233Z 2025-05-07T19:50:37.4007344Z HIPified Source Files: 2025-05-07T19:50:37.4007348Z 2025-05-07T19:50:37.4007424Z 2025-05-07T19:50:37.4007516Z Library Dependencies: 2025-05-07T19:50:37.4007595Z torch 2025-05-07T19:50:37.4007697Z torch_library 2025-05-07T19:50:37.4007989Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.4008228Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.4008561Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.4008938Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.4009192Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.4009313Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:37.4009569Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.4009647Z 2025-05-07T19:50:37.4009733Z Output Library: 2025-05-07T19:50:37.4009857Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:37.4009933Z 2025-05-07T19:50:37.4010025Z Destination Directory: 2025-05-07T19:50:37.4010125Z fbgemm_gpu 2025-05-07T19:50:37.4010233Z ================================================================================ 2025-05-07T19:50:37.4010237Z 2025-05-07T19:50:37.4010242Z 2025-05-07T19:50:37.4010246Z 2025-05-07T19:50:37.4010348Z ================================================================================ 2025-05-07T19:50:37.4010571Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:37.4010648Z 2025-05-07T19:50:37.4010727Z CPU_SRCS: 2025-05-07T19:50:37.4010731Z 2025-05-07T19:50:37.4010805Z 2025-05-07T19:50:37.4010902Z GPU_SRCS: 2025-05-07T19:50:37.4011041Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:37.4011189Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:37.4011366Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.4011527Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.4011692Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.4011862Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:37.4012065Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.4012255Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.4012408Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:37.4012573Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:37.4012739Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.4012910Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.4013038Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:37.4013115Z 2025-05-07T19:50:37.4013213Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.4013216Z 2025-05-07T19:50:37.4013295Z 2025-05-07T19:50:37.4013400Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.4013404Z 2025-05-07T19:50:37.4013479Z 2025-05-07T19:50:37.4013563Z OTHER_SRCS: 2025-05-07T19:50:37.4013567Z 2025-05-07T19:50:37.4013661Z 2025-05-07T19:50:37.4013737Z CC_FLAGS: 2025-05-07T19:50:37.4013741Z 2025-05-07T19:50:37.4013811Z 2025-05-07T19:50:37.4013909Z NVCC_FLAGS: 2025-05-07T19:50:37.4014004Z --expt-relaxed-constexpr 2025-05-07T19:50:37.4014098Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.4014201Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.4014313Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.4014385Z 2025-05-07T19:50:37.4014464Z HIPCC_FLAGS: 2025-05-07T19:50:37.4014468Z 2025-05-07T19:50:37.4014556Z 2025-05-07T19:50:37.4014636Z INCLUDE_DIRS: 2025-05-07T19:50:37.4014741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4014836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.4014957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.4015060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4015325Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.4015702Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.4015838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.4015990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.4016154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.4016392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.4016581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.4016720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.4017021Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.4017096Z 2025-05-07T19:50:37.4017227Z Selected Source Files: 2025-05-07T19:50:37.4017387Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:37.4017559Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:37.4017707Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:37.4017819Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:37.4017972Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:37.4018128Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:37.4018292Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:37.4018468Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:37.4018653Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:37.4018840Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:37.4018996Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:37.4019164Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:37.4019330Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:37.4019404Z 2025-05-07T19:50:37.4019508Z HIPified Source Files: 2025-05-07T19:50:37.4019513Z 2025-05-07T19:50:37.4019585Z 2025-05-07T19:50:37.4019675Z Library Dependencies: 2025-05-07T19:50:37.4019767Z torch 2025-05-07T19:50:37.4019847Z torch_library 2025-05-07T19:50:37.4020138Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.4020391Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.4020700Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.4021030Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.4021284Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.4021402Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:37.4021603Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.4021676Z 2025-05-07T19:50:37.4021775Z Output Library: 2025-05-07T19:50:37.4021883Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:37.4021957Z 2025-05-07T19:50:37.4022047Z Destination Directory: 2025-05-07T19:50:37.4022141Z fbgemm_gpu 2025-05-07T19:50:37.4022248Z ================================================================================ 2025-05-07T19:50:37.4022252Z 2025-05-07T19:50:37.4022259Z 2025-05-07T19:50:37.4022263Z 2025-05-07T19:50:37.4022364Z ================================================================================ 2025-05-07T19:50:37.4022593Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:37.4022670Z 2025-05-07T19:50:37.4022750Z CPU_SRCS: 2025-05-07T19:50:37.4022753Z 2025-05-07T19:50:37.4022841Z 2025-05-07T19:50:37.4022919Z GPU_SRCS: 2025-05-07T19:50:37.4023035Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:37.4023177Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:37.4023282Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:37.4023388Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:37.4023491Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:37.4023618Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:37.4023764Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:37.4023908Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:37.4024074Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:37.4024246Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:37.4024363Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:37.4024513Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:37.4024726Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:37.4025037Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:37.4025228Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:37.4025404Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:37.4025528Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:37.4025677Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:37.4025848Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:37.4026027Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:37.4026215Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:37.4026349Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:37.4026508Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:37.4026643Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:37.4026787Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:37.4026941Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:37.4027083Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:37.4027228Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:37.4027400Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:37.4027593Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:37.4027792Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:37.4027983Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:37.4028198Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:37.4028331Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:37.4028473Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:37.4028705Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:37.4028936Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:37.4029011Z 2025-05-07T19:50:37.4029113Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.4029117Z 2025-05-07T19:50:37.4029191Z 2025-05-07T19:50:37.4029275Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.4029279Z 2025-05-07T19:50:37.4029351Z 2025-05-07T19:50:37.4029445Z OTHER_SRCS: 2025-05-07T19:50:37.4029449Z 2025-05-07T19:50:37.4029520Z 2025-05-07T19:50:37.4029597Z CC_FLAGS: 2025-05-07T19:50:37.4029600Z 2025-05-07T19:50:37.4029685Z 2025-05-07T19:50:37.4029764Z NVCC_FLAGS: 2025-05-07T19:50:37.4029861Z --expt-relaxed-constexpr 2025-05-07T19:50:37.4029958Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.4030067Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.4030161Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.4030235Z 2025-05-07T19:50:37.4030329Z HIPCC_FLAGS: 2025-05-07T19:50:37.4030332Z 2025-05-07T19:50:37.4030405Z 2025-05-07T19:50:37.4030485Z INCLUDE_DIRS: 2025-05-07T19:50:37.4030592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4030705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.4030807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.4030915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4031198Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.4031650Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.4031970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.4032154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.4032370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.4032583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.4032789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.4032955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.4033309Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.4033390Z 2025-05-07T19:50:37.4033501Z Selected Source Files: 2025-05-07T19:50:37.4033621Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:37.4042528Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:37.4042698Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:37.4042823Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:37.4042932Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:37.4043048Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:37.4043232Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:37.4043384Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:37.4043490Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:37.4043674Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:37.4043806Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:37.4044081Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:37.4044272Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:37.4044491Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:37.4044678Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:37.4044830Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:37.4044961Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:37.4045103Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:37.4045259Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:37.4045427Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:37.4045612Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:37.4045740Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:37.4045875Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:37.4046016Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:37.4046153Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:37.4046283Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:37.4046428Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:37.4046567Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:37.4046718Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:37.4046907Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:37.4047116Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:37.4047304Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:37.4047499Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:37.4047641Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:37.4047782Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:37.4048000Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:37.4048233Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:37.4048303Z 2025-05-07T19:50:37.4048390Z HIPified Source Files: 2025-05-07T19:50:37.4048396Z 2025-05-07T19:50:37.4048464Z 2025-05-07T19:50:37.4048557Z Library Dependencies: 2025-05-07T19:50:37.4048625Z torch 2025-05-07T19:50:37.4048701Z torch_library 2025-05-07T19:50:37.4049006Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.4049367Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.4049671Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.4050010Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.4050323Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.4050404Z fbgemm_gpu_config 2025-05-07T19:50:37.4050484Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:37.4050693Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.4050761Z 2025-05-07T19:50:37.4050841Z Output Library: 2025-05-07T19:50:37.4050968Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:37.4051036Z 2025-05-07T19:50:37.4051122Z Destination Directory: 2025-05-07T19:50:37.4051196Z fbgemm_gpu 2025-05-07T19:50:37.4051315Z ================================================================================ 2025-05-07T19:50:37.4051320Z 2025-05-07T19:50:37.4051324Z 2025-05-07T19:50:37.4051327Z 2025-05-07T19:50:37.4051423Z ================================================================================ 2025-05-07T19:50:37.4051583Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:37.4051667Z 2025-05-07T19:50:37.4051742Z CPU_SRCS: 2025-05-07T19:50:37.4051940Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:37.4052128Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:37.4052197Z 2025-05-07T19:50:37.4052268Z GPU_SRCS: 2025-05-07T19:50:37.4052445Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:37.4052582Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:37.4052698Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:37.4052822Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:37.4052966Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:37.4053088Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:37.4053213Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:37.4053346Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:37.4053413Z 2025-05-07T19:50:37.4053494Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.4053499Z 2025-05-07T19:50:37.4053568Z 2025-05-07T19:50:37.4053656Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.4053659Z 2025-05-07T19:50:37.4053728Z 2025-05-07T19:50:37.4053804Z OTHER_SRCS: 2025-05-07T19:50:37.4053809Z 2025-05-07T19:50:37.4053883Z 2025-05-07T19:50:37.4053954Z CC_FLAGS: 2025-05-07T19:50:37.4053958Z 2025-05-07T19:50:37.4054024Z 2025-05-07T19:50:37.4054095Z NVCC_FLAGS: 2025-05-07T19:50:37.4054192Z --expt-relaxed-constexpr 2025-05-07T19:50:37.4054281Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.4054375Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.4054472Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.4054541Z 2025-05-07T19:50:37.4054616Z HIPCC_FLAGS: 2025-05-07T19:50:37.4054620Z 2025-05-07T19:50:37.4054686Z 2025-05-07T19:50:37.4054773Z INCLUDE_DIRS: 2025-05-07T19:50:37.4054870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4054960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.4055063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.4055161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4055422Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.4055792Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.4055924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.4056077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.4056221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.4056416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.4056651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.4056787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.4057080Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.4057149Z 2025-05-07T19:50:37.4057234Z Selected Source Files: 2025-05-07T19:50:37.4057479Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:37.4057654Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:37.4057833Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:37.4057960Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:37.4058085Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:37.4058212Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:37.4058345Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:37.4058482Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:37.4058607Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:37.4058729Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:37.4058800Z 2025-05-07T19:50:37.4058894Z HIPified Source Files: 2025-05-07T19:50:37.4058898Z 2025-05-07T19:50:37.4058967Z 2025-05-07T19:50:37.4059053Z Library Dependencies: 2025-05-07T19:50:37.4059137Z torch 2025-05-07T19:50:37.4059211Z torch_library 2025-05-07T19:50:37.4059497Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.4059744Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.4060047Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.4060367Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.4060619Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.4060729Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:37.4060808Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:37.4061002Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.4061082Z 2025-05-07T19:50:37.4061159Z Output Library: 2025-05-07T19:50:37.4061250Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:37.4061316Z 2025-05-07T19:50:37.4061414Z Destination Directory: 2025-05-07T19:50:37.4061493Z fbgemm_gpu 2025-05-07T19:50:37.4061593Z ================================================================================ 2025-05-07T19:50:37.4061597Z 2025-05-07T19:50:37.4061601Z 2025-05-07T19:50:37.4061604Z 2025-05-07T19:50:37.4061712Z ================================================================================ 2025-05-07T19:50:37.4061893Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:37.4061961Z 2025-05-07T19:50:37.4062046Z CPU_SRCS: 2025-05-07T19:50:37.4062214Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:37.4062284Z 2025-05-07T19:50:37.4062353Z GPU_SRCS: 2025-05-07T19:50:37.4062521Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:37.4062667Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:37.4062739Z 2025-05-07T19:50:37.4062827Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.4062831Z 2025-05-07T19:50:37.4062903Z 2025-05-07T19:50:37.4062980Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.4062983Z 2025-05-07T19:50:37.4063062Z 2025-05-07T19:50:37.4063139Z OTHER_SRCS: 2025-05-07T19:50:37.4063143Z 2025-05-07T19:50:37.4063211Z 2025-05-07T19:50:37.4063285Z CC_FLAGS: 2025-05-07T19:50:37.4063289Z 2025-05-07T19:50:37.4063367Z 2025-05-07T19:50:37.4063440Z NVCC_FLAGS: 2025-05-07T19:50:37.4063531Z --expt-relaxed-constexpr 2025-05-07T19:50:37.4063631Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.4063726Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.4063867Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.4063937Z 2025-05-07T19:50:37.4064021Z HIPCC_FLAGS: 2025-05-07T19:50:37.4064025Z 2025-05-07T19:50:37.4064090Z 2025-05-07T19:50:37.4064162Z INCLUDE_DIRS: 2025-05-07T19:50:37.4064271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4064359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.4064452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.4064593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4065219Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.4065613Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.4065754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.4065923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.4066080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.4066284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.4066502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.4066648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.4066950Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.4067020Z 2025-05-07T19:50:37.4067118Z Selected Source Files: 2025-05-07T19:50:37.4067299Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:37.4067472Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:37.4067635Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:37.4067707Z 2025-05-07T19:50:37.4067799Z HIPified Source Files: 2025-05-07T19:50:37.4067804Z 2025-05-07T19:50:37.4067886Z 2025-05-07T19:50:37.4067976Z Library Dependencies: 2025-05-07T19:50:37.4068052Z torch 2025-05-07T19:50:37.4068133Z torch_library 2025-05-07T19:50:37.4068445Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.4068698Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.4069023Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.4069377Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.4069649Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.4069859Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.4069943Z 2025-05-07T19:50:37.4070024Z Output Library: 2025-05-07T19:50:37.4070132Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:37.4070205Z 2025-05-07T19:50:37.4070306Z Destination Directory: 2025-05-07T19:50:37.4070382Z fbgemm_gpu 2025-05-07T19:50:37.4070489Z ================================================================================ 2025-05-07T19:50:37.4070497Z 2025-05-07T19:50:37.4070501Z 2025-05-07T19:50:37.4070505Z 2025-05-07T19:50:37.4070623Z ================================================================================ 2025-05-07T19:50:37.4070749Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:37.4070822Z 2025-05-07T19:50:37.4070914Z CPU_SRCS: 2025-05-07T19:50:37.4071014Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:37.4071120Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:37.4071425Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:37.4071650Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:37.4071859Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:37.4072076Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:37.4072327Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:37.4072567Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:37.4072822Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:37.4072964Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:37.4073096Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:37.4073218Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:37.4073369Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:37.4073552Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:37.4073664Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:37.4073792Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:37.4073907Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:37.4074011Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:37.4074109Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:37.4074205Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:37.4074319Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:37.4074422Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:37.4074529Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:37.4074639Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:37.4074881Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:37.4075034Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:37.4075254Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:37.4075507Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:37.4075613Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:37.4075716Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:37.4075828Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:37.4075944Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:37.4076141Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:37.4076244Z src/topology_utils.cpp 2025-05-07T19:50:37.4076315Z 2025-05-07T19:50:37.4076393Z GPU_SRCS: 2025-05-07T19:50:37.4076509Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:37.4076634Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:37.4076847Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:37.4076947Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:37.4077059Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:37.4077252Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:37.4077439Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:37.4077570Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:37.4077716Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:37.4077976Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:37.4078156Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:37.4078343Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:37.4078488Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:37.4078646Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:37.4078794Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:37.4078926Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:37.4079057Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:37.4079173Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:37.4079349Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:37.4079505Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:37.4079632Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:37.4079785Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:37.4079918Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:37.4080027Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:37.4080251Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:37.4080446Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:37.4080702Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:37.4080807Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:37.4080922Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:37.4081061Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:37.4081187Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:37.4081327Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:37.4081431Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:37.4081570Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:37.4081669Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:37.4081792Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:37.4081938Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:37.4082055Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:37.4082195Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:37.4082345Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:37.4082495Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:37.4082600Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:37.4082707Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:37.4082826Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:37.4082933Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:37.4083067Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:37.4083206Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:37.4083311Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:37.4083413Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:37.4083628Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:37.4083748Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:37.4083842Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:37.4083950Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:37.4084065Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:37.4084159Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:37.4084225Z 2025-05-07T19:50:37.4084305Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:37.4084310Z 2025-05-07T19:50:37.4084387Z 2025-05-07T19:50:37.4084465Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:37.4084469Z 2025-05-07T19:50:37.4084539Z 2025-05-07T19:50:37.4084623Z OTHER_SRCS: 2025-05-07T19:50:37.4084628Z 2025-05-07T19:50:37.4084693Z 2025-05-07T19:50:37.4084765Z CC_FLAGS: 2025-05-07T19:50:37.4084772Z 2025-05-07T19:50:37.4084845Z 2025-05-07T19:50:37.4084916Z NVCC_FLAGS: 2025-05-07T19:50:37.4085004Z --expt-relaxed-constexpr 2025-05-07T19:50:37.4085093Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:37.4085201Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:37.4085286Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:37.4085352Z 2025-05-07T19:50:37.4085428Z HIPCC_FLAGS: 2025-05-07T19:50:37.4085444Z 2025-05-07T19:50:37.4085509Z 2025-05-07T19:50:37.4085583Z INCLUDE_DIRS: 2025-05-07T19:50:37.4085683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4085783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:37.4085880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:37.4085976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:37.4086249Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include 2025-05-07T19:50:37.4086612Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:37.4086745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:37.4086896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:37.4087048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:37.4087235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:37.4087422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:37.4087564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:37.4087894Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include 2025-05-07T19:50:37.4087964Z 2025-05-07T19:50:37.4088063Z Selected Source Files: 2025-05-07T19:50:37.4088155Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:37.4088255Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:37.4088440Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:37.4088689Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:37.4088881Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:37.4089086Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:37.4089302Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:37.4089521Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:37.4089659Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:37.4089792Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:37.4089917Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:37.4090026Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:37.4090164Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:37.4090276Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:37.4090375Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:37.4090497Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:37.4090600Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:37.4090694Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:37.4090780Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:37.4090864Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:37.4090974Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:37.4091069Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:37.4091164Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:37.4091262Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:37.4091486Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:37.4091632Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:37.4091844Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:37.4092066Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:37.4092168Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:37.4092266Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:37.4092373Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:37.4092484Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:37.4092671Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:37.4092760Z src/topology_utils.cpp 2025-05-07T19:50:37.4092867Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:37.4092966Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:37.4093162Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:37.4093264Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:37.4093365Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:37.4093542Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:37.4093719Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:37.4093840Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:37.4093961Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:37.4094207Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:37.4094376Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:37.4094539Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:37.4094674Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:37.4094819Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:37.4094946Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:37.4095066Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:37.4095411Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:37.4095520Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:37.4095672Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:37.4095823Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:37.4095939Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:37.4096122Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:37.4096251Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:37.4096353Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:37.4096560Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:37.4096741Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:37.4096922Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:37.4097019Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:37.4097127Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:37.4097248Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:37.4097381Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:37.4097487Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:37.4097580Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:37.4097698Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:37.4097797Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:37.4097919Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:37.4098046Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:37.4098155Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:37.4098292Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:37.4098430Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:37.4098561Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:37.4098668Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:37.4098766Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:37.4098868Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:37.4098975Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:37.4099103Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:37.4099218Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:37.4099313Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:37.4099417Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:37.4099516Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:37.4099624Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:37.4099724Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:37.4099842Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:37.4099946Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:37.4100042Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:37.4100119Z 2025-05-07T19:50:37.4100203Z HIPified Source Files: 2025-05-07T19:50:37.4100206Z 2025-05-07T19:50:37.4100274Z 2025-05-07T19:50:37.4100362Z Library Dependencies: 2025-05-07T19:50:37.4100440Z torch 2025-05-07T19:50:37.4100512Z torch_library 2025-05-07T19:50:37.4100805Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so 2025-05-07T19:50:37.4101046Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:37.4101354Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:37.4101683Z /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:37.4101941Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:37.4102011Z fbgemm 2025-05-07T19:50:37.4102105Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:37.4102203Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:37.4102301Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:37.4102381Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:37.4102468Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:37.4102605Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:37.4102800Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:37.4102871Z 2025-05-07T19:50:37.4102949Z Output Library: 2025-05-07T19:50:37.4103036Z fbgemm_gpu_py 2025-05-07T19:50:37.4103104Z 2025-05-07T19:50:37.4103189Z Destination Directory: 2025-05-07T19:50:37.4103275Z fbgemm_gpu 2025-05-07T19:50:37.4103412Z ================================================================================ 2025-05-07T19:50:37.4103417Z 2025-05-07T19:50:37.4103507Z -- Configuring done (9.1s) 2025-05-07T19:50:37.5419464Z -- Generating done (0.2s) 2025-05-07T19:50:37.5433926Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build 2025-05-07T19:50:37.5611970Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build' 2025-05-07T19:50:37.5612061Z 2025-05-07T19:50:37.5612651Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:37.6838620Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:37.6850266Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7041080Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:37.7052973Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7081219Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:37.7093239Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7137582Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:37.7148771Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7280085Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:37.7291681Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7306498Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:37.7317696Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7377103Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:37.7389716Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7401664Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:37.7413873Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7426467Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:37.7438840Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7566705Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:37.7578816Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7684352Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:37.7696233Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.7856532Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:37.7868227Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8004796Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:37.8016478Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8082354Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:37.8094716Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8107302Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:37.8118773Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8129837Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:37.8141348Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8247800Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:37.8259556Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8295613Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:37.8307350Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8411132Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:37.8422927Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.8687515Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:37.8699298Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9052113Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:37.9064299Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9076615Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:37.9088960Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9115788Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:37.9128186Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9142847Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:37.9154893Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9166762Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:37.9178637Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9194927Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:37.9207097Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9374282Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:37.9385876Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9513068Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:37.9525744Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9537989Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:37.9549451Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9636639Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:37.9648614Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9732258Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:37.9744357Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9755882Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:37.9767476Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9802484Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:37.9814055Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:37.9925594Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:37.9937369Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.0408445Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:38.0420230Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.0551846Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:38.0558382Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.0621339Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:38.0633784Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.0645378Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:38.0657218Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.0739487Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:38.0814849Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.0826570Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:38.0837611Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.0963020Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:38.0974813Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.1130910Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:38.1142425Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.1154317Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:38.1165618Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.1415877Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:38.1428327Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.2075377Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:38.2087599Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.2370597Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:38.2383585Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.2759080Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:38.2772203Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.3050692Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:38.3063143Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.3229641Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:38.3242518Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.3358547Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:38.3371426Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.3777425Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:38.3790321Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.4296180Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:38.4308625Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.4788541Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:38.4801186Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.6141732Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:38.6154821Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.6977500Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:38.6989814Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.7901625Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:38.7920538Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.7939249Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:38.7957602Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.8933936Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:38.8946241Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:38.9572736Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:38.9585641Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:39.0286044Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:39.0298758Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:39.1295832Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:39.1314549Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:39.3461472Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:39.3474509Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:39.9344543Z [63/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:40.2348403Z [64/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:40.2366068Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6414928Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:40.6432985Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.9717140Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:43.9734737Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:44.4439595Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:44.4452796Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.0028966Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:46.0046444Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.0447088Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:46.0465313Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.1678458Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:46.1696448Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.2061608Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:46.2080295Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:46.2211939Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:46.2231038Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.1023271Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:47.1041969Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.9145509Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:47.9163540Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:48.5875848Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:48.5895392Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.4166235Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:49.4183935Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.7650365Z [77/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:49.7668706Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.9851529Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:50.8945441Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:50.8960306Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:53.1352311Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:53.1366928Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:53.3319540Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:53.3337144Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:54.9026246Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:54.9044659Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:57.2868827Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:57.2887476Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:57.9055514Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:57.9073644Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:57.9802216Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:57.9821017Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:00.6293295Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:51:00.6309992Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:01.7761856Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:51:01.7780633Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:03.9935187Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:51:03.9950994Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:07.5641428Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:07.5660832Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:07.5848027Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:07.5867541Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:09.7585520Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:09.7604570Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:11.0051980Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:11.0073444Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:12.7825543Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:12.7844123Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:12.8884143Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:12.8901989Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:16.6822618Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:16.6833205Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:16.7523254Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:16.7541324Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:18.9126223Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:18.9146940Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:20.1686087Z [98/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:20.1706556Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:21.9261608Z [99/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:21.9280988Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:21.9700472Z [100/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:21.9720050Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:26.0217393Z [101/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:26.0237568Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:34.4581439Z [102/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:34.4599493Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:34.5373511Z [103/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:34.5389735Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:39.5165905Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:39.5189724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5192622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5193822Z ^ 2025-05-07T19:51:39.5194107Z 2025-05-07T19:51:39.5194579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5195265Z 2025-05-07T19:51:39.5196998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5199717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5200838Z ^ 2025-05-07T19:51:39.5201196Z 2025-05-07T19:51:39.5202643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5205308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5206451Z ^ 2025-05-07T19:51:39.5206710Z 2025-05-07T19:51:39.5207173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5207901Z 2025-05-07T19:51:39.5209542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5212286Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5213532Z ^ 2025-05-07T19:51:39.5213945Z 2025-05-07T19:51:39.5215676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5218390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5219541Z ^ 2025-05-07T19:51:39.5219820Z 2025-05-07T19:51:39.5220256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5220938Z 2025-05-07T19:51:39.5222698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5225685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5226858Z ^ 2025-05-07T19:51:39.5227208Z 2025-05-07T19:51:39.5228893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5231701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5232872Z ^ 2025-05-07T19:51:39.5233108Z 2025-05-07T19:51:39.5233536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5234200Z 2025-05-07T19:51:39.5235862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5238450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5239627Z ^ 2025-05-07T19:51:39.5240001Z 2025-05-07T19:51:39.5241655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5244352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5245544Z ^ 2025-05-07T19:51:39.5245802Z 2025-05-07T19:51:39.5246295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.5246974Z 2025-05-07T19:51:39.5248660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.5251411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.5252631Z ^ 2025-05-07T19:51:39.5252997Z 2025-05-07T19:51:39.8527318Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:39.8550472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8553387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8554553Z ^ 2025-05-07T19:51:39.8554827Z 2025-05-07T19:51:39.8555272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8555947Z 2025-05-07T19:51:39.8557440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8559907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8560944Z ^ 2025-05-07T19:51:39.8561309Z 2025-05-07T19:51:39.8563051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8566053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8567254Z ^ 2025-05-07T19:51:39.8567513Z 2025-05-07T19:51:39.8567965Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8568658Z 2025-05-07T19:51:39.8570381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8573203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8574419Z ^ 2025-05-07T19:51:39.8574790Z 2025-05-07T19:51:39.8576500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8579263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8580459Z ^ 2025-05-07T19:51:39.8580719Z 2025-05-07T19:51:39.8581166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8582196Z 2025-05-07T19:51:39.8583943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8586453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8587881Z ^ 2025-05-07T19:51:39.8588256Z 2025-05-07T19:51:39.8589720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8592385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8593510Z ^ 2025-05-07T19:51:39.8593792Z 2025-05-07T19:51:39.8594228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8594916Z 2025-05-07T19:51:39.8596587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8599294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8600484Z ^ 2025-05-07T19:51:39.8600877Z 2025-05-07T19:51:39.8602521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8605254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8606427Z ^ 2025-05-07T19:51:39.8606687Z 2025-05-07T19:51:39.8607149Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.8607806Z 2025-05-07T19:51:39.8609497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.8611931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.8613100Z ^ 2025-05-07T19:51:39.8613474Z 2025-05-07T19:51:39.9086813Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:39.9108884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9111683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9112821Z ^ 2025-05-07T19:51:39.9113074Z 2025-05-07T19:51:39.9113515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.9114203Z 2025-05-07T19:51:39.9115872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9118611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9119780Z ^ 2025-05-07T19:51:39.9120187Z 2025-05-07T19:51:39.9121762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9124099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9125131Z ^ 2025-05-07T19:51:39.9125389Z 2025-05-07T19:51:39.9125784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.9126409Z 2025-05-07T19:51:39.9127965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9130464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9131640Z ^ 2025-05-07T19:51:39.9131965Z 2025-05-07T19:51:39.9133437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9135940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9137258Z ^ 2025-05-07T19:51:39.9137486Z 2025-05-07T19:51:39.9137916Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.9138461Z 2025-05-07T19:51:39.9140084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9142770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9143932Z ^ 2025-05-07T19:51:39.9144308Z 2025-05-07T19:51:39.9145889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9148498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9149678Z ^ 2025-05-07T19:51:39.9149925Z 2025-05-07T19:51:39.9150403Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.9151070Z 2025-05-07T19:51:39.9152908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9155651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9156880Z ^ 2025-05-07T19:51:39.9157250Z 2025-05-07T19:51:39.9158922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9161656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9162785Z ^ 2025-05-07T19:51:39.9163035Z 2025-05-07T19:51:39.9163542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:39.9164223Z 2025-05-07T19:51:39.9166130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:39.9168867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:39.9170043Z ^ 2025-05-07T19:51:39.9170448Z 2025-05-07T19:51:40.9110779Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:40.9137176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9140049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9141202Z ^ 2025-05-07T19:51:40.9141462Z 2025-05-07T19:51:40.9141917Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9142588Z 2025-05-07T19:51:40.9144201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9146979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9148219Z ^ 2025-05-07T19:51:40.9148624Z 2025-05-07T19:51:40.9150360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9153170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9154228Z ^ 2025-05-07T19:51:40.9154462Z 2025-05-07T19:51:40.9154811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9155452Z 2025-05-07T19:51:40.9157126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9159980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9161234Z ^ 2025-05-07T19:51:40.9161615Z 2025-05-07T19:51:40.9163349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9166658Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9167853Z ^ 2025-05-07T19:51:40.9168112Z 2025-05-07T19:51:40.9168589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9169282Z 2025-05-07T19:51:40.9171262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9173919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9175071Z ^ 2025-05-07T19:51:40.9175463Z 2025-05-07T19:51:40.9177163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9179952Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9181133Z ^ 2025-05-07T19:51:40.9181423Z 2025-05-07T19:51:40.9181903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9182578Z 2025-05-07T19:51:40.9184321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9187049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9188287Z ^ 2025-05-07T19:51:40.9188666Z 2025-05-07T19:51:40.9190355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9193263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9194490Z ^ 2025-05-07T19:51:40.9194755Z 2025-05-07T19:51:40.9195221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9195927Z 2025-05-07T19:51:40.9197633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9200340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9201496Z ^ 2025-05-07T19:51:40.9201832Z 2025-05-07T19:51:40.9273218Z [108/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:40.9295353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9298029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9299205Z ^ 2025-05-07T19:51:40.9299450Z 2025-05-07T19:51:40.9299840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9300398Z 2025-05-07T19:51:40.9302074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9304786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9306020Z ^ 2025-05-07T19:51:40.9306401Z 2025-05-07T19:51:40.9308198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9310935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9312278Z ^ 2025-05-07T19:51:40.9312540Z 2025-05-07T19:51:40.9313015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9313724Z 2025-05-07T19:51:40.9315463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9318232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9319367Z ^ 2025-05-07T19:51:40.9319774Z 2025-05-07T19:51:40.9321514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9324542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9325762Z ^ 2025-05-07T19:51:40.9326048Z 2025-05-07T19:51:40.9326655Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9327347Z 2025-05-07T19:51:40.9328971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9331797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9333063Z ^ 2025-05-07T19:51:40.9333445Z 2025-05-07T19:51:40.9335176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9337861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9339013Z ^ 2025-05-07T19:51:40.9339261Z 2025-05-07T19:51:40.9339704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9340393Z 2025-05-07T19:51:40.9342015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9344847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9346107Z ^ 2025-05-07T19:51:40.9346505Z 2025-05-07T19:51:40.9348227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9350849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9351963Z ^ 2025-05-07T19:51:40.9352171Z 2025-05-07T19:51:40.9352631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9353263Z 2025-05-07T19:51:40.9354814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9357435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9358544Z ^ 2025-05-07T19:51:40.9358907Z 2025-05-07T19:51:40.9435216Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:40.9455617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9458528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9459792Z ^ 2025-05-07T19:51:40.9474717Z 2025-05-07T19:51:40.9475452Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9476249Z 2025-05-07T19:51:40.9478011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9480884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9482098Z ^ 2025-05-07T19:51:40.9482500Z 2025-05-07T19:51:40.9484262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9487091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9488323Z ^ 2025-05-07T19:51:40.9488610Z 2025-05-07T19:51:40.9489109Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9489816Z 2025-05-07T19:51:40.9491573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9494236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9495662Z ^ 2025-05-07T19:51:40.9496000Z 2025-05-07T19:51:40.9497599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9500640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9501838Z ^ 2025-05-07T19:51:40.9502104Z 2025-05-07T19:51:40.9502566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9503239Z 2025-05-07T19:51:40.9505029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9507814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9509074Z ^ 2025-05-07T19:51:40.9509455Z 2025-05-07T19:51:40.9511202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9514064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9515289Z ^ 2025-05-07T19:51:40.9515545Z 2025-05-07T19:51:40.9516032Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9516708Z 2025-05-07T19:51:40.9518411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9521124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9522329Z ^ 2025-05-07T19:51:40.9522726Z 2025-05-07T19:51:40.9524414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9527139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9528315Z ^ 2025-05-07T19:51:40.9528582Z 2025-05-07T19:51:40.9529027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:40.9529712Z 2025-05-07T19:51:40.9531063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:40.9533155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:40.9534153Z ^ 2025-05-07T19:51:40.9534482Z 2025-05-07T19:51:41.4140278Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:41.4160768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4163168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4164221Z ^ 2025-05-07T19:51:41.4164554Z 2025-05-07T19:51:41.4165182Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.4165759Z 2025-05-07T19:51:41.4167097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4169243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4170443Z ^ 2025-05-07T19:51:41.4170803Z 2025-05-07T19:51:41.4172608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4175024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4175999Z ^ 2025-05-07T19:51:41.4176254Z 2025-05-07T19:51:41.4176735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.4177369Z 2025-05-07T19:51:41.4178804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4181781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4183015Z ^ 2025-05-07T19:51:41.4183384Z 2025-05-07T19:51:41.4185251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4187884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4188971Z ^ 2025-05-07T19:51:41.4189235Z 2025-05-07T19:51:41.4189632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.4190197Z 2025-05-07T19:51:41.4191933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4194513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4195699Z ^ 2025-05-07T19:51:41.4195998Z 2025-05-07T19:51:41.4197296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4199680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4200795Z ^ 2025-05-07T19:51:41.4201037Z 2025-05-07T19:51:41.4201455Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.4202065Z 2025-05-07T19:51:41.4203468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4205666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4206564Z ^ 2025-05-07T19:51:41.4206875Z 2025-05-07T19:51:41.4208179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4210388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4211369Z ^ 2025-05-07T19:51:41.4211634Z 2025-05-07T19:51:41.4212028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.4212606Z 2025-05-07T19:51:41.4214113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.4216479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.4217492Z ^ 2025-05-07T19:51:41.4217784Z 2025-05-07T19:51:41.5760434Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:41.5783926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5786756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5787993Z ^ 2025-05-07T19:51:41.5788259Z 2025-05-07T19:51:41.5788725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.5789413Z 2025-05-07T19:51:41.5791073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5794024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5795259Z ^ 2025-05-07T19:51:41.5795663Z 2025-05-07T19:51:41.5797455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5800144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5801367Z ^ 2025-05-07T19:51:41.5801664Z 2025-05-07T19:51:41.5802128Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.5802823Z 2025-05-07T19:51:41.5804940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5807764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5809202Z ^ 2025-05-07T19:51:41.5809589Z 2025-05-07T19:51:41.5811372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5814154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5815361Z ^ 2025-05-07T19:51:41.5815636Z 2025-05-07T19:51:41.5816095Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.5816788Z 2025-05-07T19:51:41.5818476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5821261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5822430Z ^ 2025-05-07T19:51:41.5822841Z 2025-05-07T19:51:41.5824577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5827411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5828658Z ^ 2025-05-07T19:51:41.5828929Z 2025-05-07T19:51:41.5829436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.5830141Z 2025-05-07T19:51:41.5832027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5834863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5836306Z ^ 2025-05-07T19:51:41.5836685Z 2025-05-07T19:51:41.5838412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5841011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5842242Z ^ 2025-05-07T19:51:41.5842504Z 2025-05-07T19:51:41.5842972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:41.5843706Z 2025-05-07T19:51:41.5845489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:41.5848334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:41.5849526Z ^ 2025-05-07T19:51:41.5849833Z 2025-05-07T19:51:41.9574875Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:43.8295139Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:43.8316891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8319609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8320630Z ^ 2025-05-07T19:51:43.8320857Z 2025-05-07T19:51:43.8321222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8321781Z 2025-05-07T19:51:43.8323471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8325869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8326945Z ^ 2025-05-07T19:51:43.8327301Z 2025-05-07T19:51:43.8328791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8331253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8332314Z ^ 2025-05-07T19:51:43.8332578Z 2025-05-07T19:51:43.8332973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8333565Z 2025-05-07T19:51:43.8335116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8337549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8338707Z ^ 2025-05-07T19:51:43.8339051Z 2025-05-07T19:51:43.8340579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8343084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8344235Z ^ 2025-05-07T19:51:43.8344476Z 2025-05-07T19:51:43.8344901Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8345538Z 2025-05-07T19:51:43.8347115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8349676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8350787Z ^ 2025-05-07T19:51:43.8351599Z 2025-05-07T19:51:43.8353166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8355706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8356799Z ^ 2025-05-07T19:51:43.8357027Z 2025-05-07T19:51:43.8357453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8358035Z 2025-05-07T19:51:43.8359501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8362112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8363328Z ^ 2025-05-07T19:51:43.8363682Z 2025-05-07T19:51:43.8365577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8368299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8369449Z ^ 2025-05-07T19:51:43.8369711Z 2025-05-07T19:51:43.8370185Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8370890Z 2025-05-07T19:51:43.8372620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8375423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8376515Z ^ 2025-05-07T19:51:43.8376848Z 2025-05-07T19:51:51.9829153Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:51.9852283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9855168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9856362Z ^ 2025-05-07T19:51:51.9856620Z 2025-05-07T19:51:51.9857245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.9857971Z 2025-05-07T19:51:51.9859604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9862357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9863621Z ^ 2025-05-07T19:51:51.9864028Z 2025-05-07T19:51:51.9866059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9868925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9870058Z ^ 2025-05-07T19:51:51.9870357Z 2025-05-07T19:51:51.9870822Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.9871626Z 2025-05-07T19:51:51.9873301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9876040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9877380Z ^ 2025-05-07T19:51:51.9877803Z 2025-05-07T19:51:51.9879518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9882276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9883554Z ^ 2025-05-07T19:51:51.9883845Z 2025-05-07T19:51:51.9884368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.9885117Z 2025-05-07T19:51:51.9886819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9889601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9891248Z ^ 2025-05-07T19:51:51.9891685Z 2025-05-07T19:51:51.9893532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9896280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9897516Z ^ 2025-05-07T19:51:51.9897793Z 2025-05-07T19:51:51.9898221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.9898861Z 2025-05-07T19:51:51.9900636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9903475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9904772Z ^ 2025-05-07T19:51:51.9905178Z 2025-05-07T19:51:51.9906814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9909555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9910710Z ^ 2025-05-07T19:51:51.9910945Z 2025-05-07T19:51:51.9911566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:51.9912252Z 2025-05-07T19:51:51.9913923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:51.9916761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:51.9918074Z ^ 2025-05-07T19:51:51.9918472Z 2025-05-07T19:51:53.4336351Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:53.4358068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4360714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4361845Z ^ 2025-05-07T19:51:53.4362140Z 2025-05-07T19:51:53.4362635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:53.4363276Z 2025-05-07T19:51:53.4365113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4367767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4368959Z ^ 2025-05-07T19:51:53.4369304Z 2025-05-07T19:51:53.4370886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4373516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4374645Z ^ 2025-05-07T19:51:53.4374886Z 2025-05-07T19:51:53.4375314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:53.4375974Z 2025-05-07T19:51:53.4377518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4380100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4381268Z ^ 2025-05-07T19:51:53.4381640Z 2025-05-07T19:51:53.4383243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4385824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4386931Z ^ 2025-05-07T19:51:53.4387192Z 2025-05-07T19:51:53.4387606Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:53.4388233Z 2025-05-07T19:51:53.4389846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4392986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4394201Z ^ 2025-05-07T19:51:53.4394571Z 2025-05-07T19:51:53.4397676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4400317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4401465Z ^ 2025-05-07T19:51:53.4401735Z 2025-05-07T19:51:53.4402166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:53.4402815Z 2025-05-07T19:51:53.4404461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4407046Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4408195Z ^ 2025-05-07T19:51:53.4408574Z 2025-05-07T19:51:53.4410118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4412673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4413769Z ^ 2025-05-07T19:51:53.4414038Z 2025-05-07T19:51:53.4414486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:53.4415169Z 2025-05-07T19:51:53.4416744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:53.4419342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:53.4420510Z ^ 2025-05-07T19:51:53.4420867Z 2025-05-07T19:51:58.2339622Z [116/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:51:58.2355733Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:58.5168632Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:51:58.5189946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5192704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5193881Z ^ 2025-05-07T19:51:58.5194131Z 2025-05-07T19:51:58.5194553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.5195150Z 2025-05-07T19:51:58.5196672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5199753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5200855Z ^ 2025-05-07T19:51:58.5201239Z 2025-05-07T19:51:58.5203203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5205788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5206906Z ^ 2025-05-07T19:51:58.5207192Z 2025-05-07T19:51:58.5207619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.5208264Z 2025-05-07T19:51:58.5209898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5212480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5213649Z ^ 2025-05-07T19:51:58.5214001Z 2025-05-07T19:51:58.5215637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5218197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5219352Z ^ 2025-05-07T19:51:58.5219635Z 2025-05-07T19:51:58.5220086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.5220774Z 2025-05-07T19:51:58.5222343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5224949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5226073Z ^ 2025-05-07T19:51:58.5226453Z 2025-05-07T19:51:58.5228036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5230628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5231907Z ^ 2025-05-07T19:51:58.5232195Z 2025-05-07T19:51:58.5232628Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.5233260Z 2025-05-07T19:51:58.5234855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5237451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5238510Z ^ 2025-05-07T19:51:58.5238805Z 2025-05-07T19:51:58.5240045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5242747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5243847Z ^ 2025-05-07T19:51:58.5244077Z 2025-05-07T19:51:58.5244469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.5245319Z 2025-05-07T19:51:58.5246765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.5249309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.5250415Z ^ 2025-05-07T19:51:58.5250789Z 2025-05-07T19:51:58.8739524Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:51:59.5156304Z [119/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && : 2025-05-07T19:52:00.8637764Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:52:00.8658965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8661539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8662646Z ^ 2025-05-07T19:52:00.8662905Z 2025-05-07T19:52:00.8663373Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:00.8664000Z 2025-05-07T19:52:00.8665894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8668484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8669634Z ^ 2025-05-07T19:52:00.8669991Z 2025-05-07T19:52:00.8671624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8674130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8675374Z ^ 2025-05-07T19:52:00.8675622Z 2025-05-07T19:52:00.8676087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:00.8676732Z 2025-05-07T19:52:00.8678354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8680969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8682098Z ^ 2025-05-07T19:52:00.8682453Z 2025-05-07T19:52:00.8684121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8686634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8687740Z ^ 2025-05-07T19:52:00.8687999Z 2025-05-07T19:52:00.8688407Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:00.8689045Z 2025-05-07T19:52:00.8690658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8693162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8694308Z ^ 2025-05-07T19:52:00.8694657Z 2025-05-07T19:52:00.8696269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8699200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8700375Z ^ 2025-05-07T19:52:00.8700629Z 2025-05-07T19:52:00.8701352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:00.8702003Z 2025-05-07T19:52:00.8703619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8706215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8707405Z ^ 2025-05-07T19:52:00.8707772Z 2025-05-07T19:52:00.8709302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8711999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8713081Z ^ 2025-05-07T19:52:00.8713361Z 2025-05-07T19:52:00.8713797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:00.8714450Z 2025-05-07T19:52:00.8715992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:00.8718494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:00.8719699Z ^ 2025-05-07T19:52:00.8720062Z 2025-05-07T19:52:04.4255648Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:04.4281720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4284742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4286014Z ^ 2025-05-07T19:52:04.4286309Z 2025-05-07T19:52:04.4286808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4287539Z 2025-05-07T19:52:04.4289296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4292154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4293400Z ^ 2025-05-07T19:52:04.4293831Z 2025-05-07T19:52:04.4295555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4298490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4299804Z ^ 2025-05-07T19:52:04.4300093Z 2025-05-07T19:52:04.4300616Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4301258Z 2025-05-07T19:52:04.4302799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4305606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4306978Z ^ 2025-05-07T19:52:04.4307321Z 2025-05-07T19:52:04.4309033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4311994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4313242Z ^ 2025-05-07T19:52:04.4313536Z 2025-05-07T19:52:04.4314029Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4314784Z 2025-05-07T19:52:04.4316566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4319890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4321103Z ^ 2025-05-07T19:52:04.4321507Z 2025-05-07T19:52:04.4323564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4326423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4327709Z ^ 2025-05-07T19:52:04.4327992Z 2025-05-07T19:52:04.4328507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4329235Z 2025-05-07T19:52:04.4331033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4333886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4335218Z ^ 2025-05-07T19:52:04.4335629Z 2025-05-07T19:52:04.4337420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4340293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4341352Z ^ 2025-05-07T19:52:04.4341646Z 2025-05-07T19:52:04.4342084Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:04.4342850Z 2025-05-07T19:52:04.4344588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:04.4347452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:04.4348734Z ^ 2025-05-07T19:52:04.4349133Z 2025-05-07T19:52:06.0885810Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:06.0907065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0909669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0910745Z ^ 2025-05-07T19:52:06.0910991Z 2025-05-07T19:52:06.0911544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0912181Z 2025-05-07T19:52:06.0913765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0916288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0917376Z ^ 2025-05-07T19:52:06.0917698Z 2025-05-07T19:52:06.0919223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0921485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0922554Z ^ 2025-05-07T19:52:06.0922827Z 2025-05-07T19:52:06.0923250Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0923883Z 2025-05-07T19:52:06.0925457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0927991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0929160Z ^ 2025-05-07T19:52:06.0929532Z 2025-05-07T19:52:06.0931110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0933568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0934695Z ^ 2025-05-07T19:52:06.0934960Z 2025-05-07T19:52:06.0935390Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0936367Z 2025-05-07T19:52:06.0937928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0940667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0941802Z ^ 2025-05-07T19:52:06.0942189Z 2025-05-07T19:52:06.0943702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0946263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0947370Z ^ 2025-05-07T19:52:06.0947644Z 2025-05-07T19:52:06.0948044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0948580Z 2025-05-07T19:52:06.0950103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0952793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0953962Z ^ 2025-05-07T19:52:06.0954304Z 2025-05-07T19:52:06.0955900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0958576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0959611Z ^ 2025-05-07T19:52:06.0959882Z 2025-05-07T19:52:06.0960302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0960917Z 2025-05-07T19:52:06.0962583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0965489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0966697Z ^ 2025-05-07T19:52:06.0967069Z 2025-05-07T19:52:07.3154724Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:07.3180425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3183442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3184644Z ^ 2025-05-07T19:52:07.3184941Z 2025-05-07T19:52:07.3185483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.3186243Z 2025-05-07T19:52:07.3187872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3190782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3192191Z ^ 2025-05-07T19:52:07.3192596Z 2025-05-07T19:52:07.3194313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3197194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3198440Z ^ 2025-05-07T19:52:07.3198728Z 2025-05-07T19:52:07.3199254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.3199989Z 2025-05-07T19:52:07.3201685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3204604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3205951Z ^ 2025-05-07T19:52:07.3206352Z 2025-05-07T19:52:07.3208075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3211347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3212638Z ^ 2025-05-07T19:52:07.3212951Z 2025-05-07T19:52:07.3213439Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.3214145Z 2025-05-07T19:52:07.3216254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3219192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3220522Z ^ 2025-05-07T19:52:07.3220926Z 2025-05-07T19:52:07.3222728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3225264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3226476Z ^ 2025-05-07T19:52:07.3226771Z 2025-05-07T19:52:07.3227252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.3227988Z 2025-05-07T19:52:07.3229788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3232755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3234076Z ^ 2025-05-07T19:52:07.3234407Z 2025-05-07T19:52:07.3236172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3238991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3240288Z ^ 2025-05-07T19:52:07.3240603Z 2025-05-07T19:52:07.3241066Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.3241731Z 2025-05-07T19:52:07.3243559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.3246382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.3247620Z ^ 2025-05-07T19:52:07.3247967Z 2025-05-07T19:52:07.4756714Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:07.4779772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4782494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4783688Z ^ 2025-05-07T19:52:07.4784026Z 2025-05-07T19:52:07.4784488Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.4785148Z 2025-05-07T19:52:07.4786821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4789531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4790723Z ^ 2025-05-07T19:52:07.4791086Z 2025-05-07T19:52:07.4792808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4795447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4796605Z ^ 2025-05-07T19:52:07.4796887Z 2025-05-07T19:52:07.4797318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.4797971Z 2025-05-07T19:52:07.4799560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4802272Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4803467Z ^ 2025-05-07T19:52:07.4803822Z 2025-05-07T19:52:07.4805412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4808404Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4809568Z ^ 2025-05-07T19:52:07.4809983Z 2025-05-07T19:52:07.4810392Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.4811024Z 2025-05-07T19:52:07.4812441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4814852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4815847Z ^ 2025-05-07T19:52:07.4816148Z 2025-05-07T19:52:07.4817566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4820243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4821199Z ^ 2025-05-07T19:52:07.4821445Z 2025-05-07T19:52:07.4821841Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.4822411Z 2025-05-07T19:52:07.4823968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4826673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4827897Z ^ 2025-05-07T19:52:07.4828275Z 2025-05-07T19:52:07.4829976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4832844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4834068Z ^ 2025-05-07T19:52:07.4834328Z 2025-05-07T19:52:07.4834781Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:07.4835474Z 2025-05-07T19:52:07.4837133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:07.4839893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:07.4841096Z ^ 2025-05-07T19:52:07.4841464Z 2025-05-07T19:52:08.5954900Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:08.5975942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.5978490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.5979572Z ^ 2025-05-07T19:52:08.5979807Z 2025-05-07T19:52:08.5980217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.5980829Z 2025-05-07T19:52:08.5982433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.5984977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.5986135Z ^ 2025-05-07T19:52:08.5986486Z 2025-05-07T19:52:08.5988160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.5990644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.5991825Z ^ 2025-05-07T19:52:08.5992054Z 2025-05-07T19:52:08.5992493Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.5993114Z 2025-05-07T19:52:08.5994660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.5997191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.5998289Z ^ 2025-05-07T19:52:08.5998943Z 2025-05-07T19:52:08.6000544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.6003165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.6004377Z ^ 2025-05-07T19:52:08.6004649Z 2025-05-07T19:52:08.6005052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.6005676Z 2025-05-07T19:52:08.6007250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.6009728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.6010889Z ^ 2025-05-07T19:52:08.6011235Z 2025-05-07T19:52:08.6012825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.6015395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.6016605Z ^ 2025-05-07T19:52:08.6016847Z 2025-05-07T19:52:08.6017260Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.6017907Z 2025-05-07T19:52:08.6019438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.6021955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.6023063Z ^ 2025-05-07T19:52:08.6023430Z 2025-05-07T19:52:08.6024949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.6027485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.6028594Z ^ 2025-05-07T19:52:08.6028856Z 2025-05-07T19:52:08.6029304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.6029932Z 2025-05-07T19:52:08.6031701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.6034319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.6035437Z ^ 2025-05-07T19:52:08.6035781Z 2025-05-07T19:52:39.7416613Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:39.7437560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7440197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7441324Z ^ 2025-05-07T19:52:39.7441567Z 2025-05-07T19:52:39.7442048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.7442681Z 2025-05-07T19:52:39.7444232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7446662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7447579Z ^ 2025-05-07T19:52:39.7447866Z 2025-05-07T19:52:39.7449182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7451686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7452775Z ^ 2025-05-07T19:52:39.7453016Z 2025-05-07T19:52:39.7453430Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.7454044Z 2025-05-07T19:52:39.7455295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7457802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7459282Z ^ 2025-05-07T19:52:39.7459626Z 2025-05-07T19:52:39.7461376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7463756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7465156Z ^ 2025-05-07T19:52:39.7465404Z 2025-05-07T19:52:39.7465788Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.7466384Z 2025-05-07T19:52:39.7467806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7470360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7471540Z ^ 2025-05-07T19:52:39.7471881Z 2025-05-07T19:52:39.7473461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7475741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7476787Z ^ 2025-05-07T19:52:39.7477034Z 2025-05-07T19:52:39.7477461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.7478066Z 2025-05-07T19:52:39.7479549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7481936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7483035Z ^ 2025-05-07T19:52:39.7483367Z 2025-05-07T19:52:39.7484819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7487215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7488260Z ^ 2025-05-07T19:52:39.7488542Z 2025-05-07T19:52:39.7488946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:39.7489688Z 2025-05-07T19:52:39.7491151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:39.7493159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:39.7494046Z ^ 2025-05-07T19:52:39.7494397Z 2025-05-07T19:52:40.0083433Z [127/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:40.0104426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0106937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0108051Z ^ 2025-05-07T19:52:40.0108296Z 2025-05-07T19:52:40.0108747Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:40.0109355Z 2025-05-07T19:52:40.0110866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0113517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0114632Z ^ 2025-05-07T19:52:40.0114969Z 2025-05-07T19:52:40.0116475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0118886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0119923Z ^ 2025-05-07T19:52:40.0120171Z 2025-05-07T19:52:40.0120572Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:40.0121159Z 2025-05-07T19:52:40.0122660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0125388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0126456Z ^ 2025-05-07T19:52:40.0126969Z 2025-05-07T19:52:40.0128469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0130831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0131903Z ^ 2025-05-07T19:52:40.0132135Z 2025-05-07T19:52:40.0132560Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:40.0133203Z 2025-05-07T19:52:40.0134698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0137169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0138237Z ^ 2025-05-07T19:52:40.0138604Z 2025-05-07T19:52:40.0140083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0142510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0143592Z ^ 2025-05-07T19:52:40.0143866Z 2025-05-07T19:52:40.0144286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:40.0144901Z 2025-05-07T19:52:40.0146467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0148915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0150035Z ^ 2025-05-07T19:52:40.0150365Z 2025-05-07T19:52:40.0152029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0154411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0155449Z ^ 2025-05-07T19:52:40.0155696Z 2025-05-07T19:52:40.0156114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:40.0156735Z 2025-05-07T19:52:40.0158200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:40.0160545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:40.0161608Z ^ 2025-05-07T19:52:40.0161972Z 2025-05-07T19:52:40.4008775Z [128/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:40.5921232Z [129/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:42.2189007Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:42.2213942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2217005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2218178Z ^ 2025-05-07T19:52:42.2218461Z 2025-05-07T19:52:42.2218885Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:42.2219517Z 2025-05-07T19:52:42.2220964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2223736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2224827Z ^ 2025-05-07T19:52:42.2225183Z 2025-05-07T19:52:42.2226789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2229395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2230555Z ^ 2025-05-07T19:52:42.2230819Z 2025-05-07T19:52:42.2231285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:42.2232110Z 2025-05-07T19:52:42.2233773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2236441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2237593Z ^ 2025-05-07T19:52:42.2237988Z 2025-05-07T19:52:42.2239586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2242290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2243502Z ^ 2025-05-07T19:52:42.2243775Z 2025-05-07T19:52:42.2244230Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:42.2244922Z 2025-05-07T19:52:42.2246732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2249315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2250476Z ^ 2025-05-07T19:52:42.2250842Z 2025-05-07T19:52:42.2252488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2255189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2256697Z ^ 2025-05-07T19:52:42.2256948Z 2025-05-07T19:52:42.2257394Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:42.2258062Z 2025-05-07T19:52:42.2259803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2262402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2263562Z ^ 2025-05-07T19:52:42.2263957Z 2025-05-07T19:52:42.2265865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2268569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2269762Z ^ 2025-05-07T19:52:42.2270005Z 2025-05-07T19:52:42.2270482Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:42.2271105Z 2025-05-07T19:52:42.2272792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:42.2275398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:42.2276590Z ^ 2025-05-07T19:52:42.2276962Z 2025-05-07T19:52:49.7987408Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:49.8010560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8013153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8014284Z ^ 2025-05-07T19:52:49.8014567Z 2025-05-07T19:52:49.8015025Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:49.8015680Z 2025-05-07T19:52:49.8017279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8020005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8021204Z ^ 2025-05-07T19:52:49.8021568Z 2025-05-07T19:52:49.8023216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8025928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8027289Z ^ 2025-05-07T19:52:49.8027545Z 2025-05-07T19:52:49.8027989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:49.8028695Z 2025-05-07T19:52:49.8030344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8033216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8034432Z ^ 2025-05-07T19:52:49.8034793Z 2025-05-07T19:52:49.8036495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8039053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8040076Z ^ 2025-05-07T19:52:49.8040281Z 2025-05-07T19:52:49.8040710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:49.8041340Z 2025-05-07T19:52:49.8042940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8045673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8046927Z ^ 2025-05-07T19:52:49.8047312Z 2025-05-07T19:52:49.8048999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8051688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8053223Z ^ 2025-05-07T19:52:49.8053528Z 2025-05-07T19:52:49.8054002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:49.8054700Z 2025-05-07T19:52:49.8056588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8059247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8060496Z ^ 2025-05-07T19:52:49.8060872Z 2025-05-07T19:52:49.8062601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8065639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8066863Z ^ 2025-05-07T19:52:49.8067126Z 2025-05-07T19:52:49.8067622Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:49.8068343Z 2025-05-07T19:52:49.8070072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:49.8072858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:49.8073962Z ^ 2025-05-07T19:52:49.8074340Z 2025-05-07T19:52:54.4273686Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:52:54.4297592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4300099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4301000Z ^ 2025-05-07T19:52:54.4301319Z 2025-05-07T19:52:54.4301746Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.4302380Z 2025-05-07T19:52:54.4303890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4306522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4307736Z ^ 2025-05-07T19:52:54.4308103Z 2025-05-07T19:52:54.4309713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4312580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4313851Z ^ 2025-05-07T19:52:54.4314116Z 2025-05-07T19:52:54.4314590Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.4315314Z 2025-05-07T19:52:54.4316882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4319639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4320828Z ^ 2025-05-07T19:52:54.4321184Z 2025-05-07T19:52:54.4322830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4325637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4326854Z ^ 2025-05-07T19:52:54.4327139Z 2025-05-07T19:52:54.4327550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.4328230Z 2025-05-07T19:52:54.4330000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4332735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4333958Z ^ 2025-05-07T19:52:54.4334286Z 2025-05-07T19:52:54.4336382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4339201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4340429Z ^ 2025-05-07T19:52:54.4340852Z 2025-05-07T19:52:54.4341326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.4342180Z 2025-05-07T19:52:54.4343969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4346864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4348136Z ^ 2025-05-07T19:52:54.4348545Z 2025-05-07T19:52:54.4350563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4353554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4354728Z ^ 2025-05-07T19:52:54.4355053Z 2025-05-07T19:52:54.4355510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:54.4356166Z 2025-05-07T19:52:54.4357742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:54.4360543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:54.4361809Z ^ 2025-05-07T19:52:54.4362190Z 2025-05-07T19:52:55.8641364Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:55.8666078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8669065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8670310Z ^ 2025-05-07T19:52:55.8670578Z 2025-05-07T19:52:55.8671047Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.8671891Z 2025-05-07T19:52:55.8673619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8676467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8677731Z ^ 2025-05-07T19:52:55.8678121Z 2025-05-07T19:52:55.8679888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8682705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8683971Z ^ 2025-05-07T19:52:55.8684248Z 2025-05-07T19:52:55.8684745Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.8685439Z 2025-05-07T19:52:55.8687192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8690045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8691323Z ^ 2025-05-07T19:52:55.8691698Z 2025-05-07T19:52:55.8693437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8696264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8697494Z ^ 2025-05-07T19:52:55.8697792Z 2025-05-07T19:52:55.8698256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.8698952Z 2025-05-07T19:52:55.8700716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8703892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8705156Z ^ 2025-05-07T19:52:55.8705537Z 2025-05-07T19:52:55.8707475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8710218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8711575Z ^ 2025-05-07T19:52:55.8711831Z 2025-05-07T19:52:55.8712288Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.8712993Z 2025-05-07T19:52:55.8714738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8717612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8718871Z ^ 2025-05-07T19:52:55.8719286Z 2025-05-07T19:52:55.8721045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8723895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8725138Z ^ 2025-05-07T19:52:55.8725584Z 2025-05-07T19:52:55.8726055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:55.8726769Z 2025-05-07T19:52:55.8728556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:55.8731374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:55.8732652Z ^ 2025-05-07T19:52:55.8733033Z 2025-05-07T19:52:58.3382915Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:58.3407219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3410092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3411345Z ^ 2025-05-07T19:52:58.3411610Z 2025-05-07T19:52:58.3412227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3413028Z 2025-05-07T19:52:58.3414766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3417608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3418844Z ^ 2025-05-07T19:52:58.3419241Z 2025-05-07T19:52:58.3420964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3423761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3424974Z ^ 2025-05-07T19:52:58.3425265Z 2025-05-07T19:52:58.3425733Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3426427Z 2025-05-07T19:52:58.3428195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3430990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3432354Z ^ 2025-05-07T19:52:58.3432730Z 2025-05-07T19:52:58.3434390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3437119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3438353Z ^ 2025-05-07T19:52:58.3438615Z 2025-05-07T19:52:58.3439074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3440039Z 2025-05-07T19:52:58.3441782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3444608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3445998Z ^ 2025-05-07T19:52:58.3446396Z 2025-05-07T19:52:58.3448102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3450849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3452040Z ^ 2025-05-07T19:52:58.3452308Z 2025-05-07T19:52:58.3452798Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3453477Z 2025-05-07T19:52:58.3455192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3458021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3459267Z ^ 2025-05-07T19:52:58.3459641Z 2025-05-07T19:52:58.3461367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3463844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3465317Z ^ 2025-05-07T19:52:58.3465549Z 2025-05-07T19:52:58.3465978Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:58.3466609Z 2025-05-07T19:52:58.3468259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:58.3470817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:58.3471965Z ^ 2025-05-07T19:52:58.3472325Z 2025-05-07T19:53:01.1751680Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:01.1775815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1778466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1779628Z ^ 2025-05-07T19:53:01.1779885Z 2025-05-07T19:53:01.1780345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.1781014Z 2025-05-07T19:53:01.1782631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1797423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1798663Z ^ 2025-05-07T19:53:01.1799017Z 2025-05-07T19:53:01.1800677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1803337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1804513Z ^ 2025-05-07T19:53:01.1804787Z 2025-05-07T19:53:01.1805231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.1805916Z 2025-05-07T19:53:01.1807550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1810271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1811451Z ^ 2025-05-07T19:53:01.1811804Z 2025-05-07T19:53:01.1813424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1816049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1817676Z ^ 2025-05-07T19:53:01.1817915Z 2025-05-07T19:53:01.1818383Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.1819048Z 2025-05-07T19:53:01.1820887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1823597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1824793Z ^ 2025-05-07T19:53:01.1825165Z 2025-05-07T19:53:01.1826892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1829269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1830114Z ^ 2025-05-07T19:53:01.1830406Z 2025-05-07T19:53:01.1830869Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.1831722Z 2025-05-07T19:53:01.1833457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1836153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1837345Z ^ 2025-05-07T19:53:01.1837701Z 2025-05-07T19:53:01.1839303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1841945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1843115Z ^ 2025-05-07T19:53:01.1843380Z 2025-05-07T19:53:01.1843815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:01.1844483Z 2025-05-07T19:53:01.1846107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.1848779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:01.1849934Z ^ 2025-05-07T19:53:01.1850325Z 2025-05-07T19:53:07.0000486Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:07.0024089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0026626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0027585Z ^ 2025-05-07T19:53:07.0027851Z 2025-05-07T19:53:07.0028379Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.0029065Z 2025-05-07T19:53:07.0030785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0033896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0035107Z ^ 2025-05-07T19:53:07.0035465Z 2025-05-07T19:53:07.0037112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0039803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0041005Z ^ 2025-05-07T19:53:07.0041266Z 2025-05-07T19:53:07.0041721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.0042407Z 2025-05-07T19:53:07.0044192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0046908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0048085Z ^ 2025-05-07T19:53:07.0048479Z 2025-05-07T19:53:07.0050102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0053156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0054740Z ^ 2025-05-07T19:53:07.0055033Z 2025-05-07T19:53:07.0055678Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.0056363Z 2025-05-07T19:53:07.0058233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0060991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0062241Z ^ 2025-05-07T19:53:07.0062608Z 2025-05-07T19:53:07.0064415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0067440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0068619Z ^ 2025-05-07T19:53:07.0068869Z 2025-05-07T19:53:07.0069316Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.0069995Z 2025-05-07T19:53:07.0071830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0074524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0075695Z ^ 2025-05-07T19:53:07.0076056Z 2025-05-07T19:53:07.0077699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0080375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0081571Z ^ 2025-05-07T19:53:07.0081823Z 2025-05-07T19:53:07.0082292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:07.0082956Z 2025-05-07T19:53:07.0084609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.0087429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:07.0088842Z ^ 2025-05-07T19:53:07.0089213Z 2025-05-07T19:53:12.5882507Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:12.5908594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.5911755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.5913033Z ^ 2025-05-07T19:53:12.5913336Z 2025-05-07T19:53:12.5913813Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5914516Z 2025-05-07T19:53:12.5916300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.5919138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.5920437Z ^ 2025-05-07T19:53:12.5920994Z 2025-05-07T19:53:12.5922775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5924968Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5925590Z ^ 2025-05-07T19:53:12.5925889Z 2025-05-07T19:53:12.5927566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5929767Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5930391Z ^ 2025-05-07T19:53:12.5930970Z 2025-05-07T19:53:12.5932687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5934870Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5935502Z ^ 2025-05-07T19:53:12.5935822Z 2025-05-07T19:53:12.5937730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.5940747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.5941960Z ^ 2025-05-07T19:53:12.5942254Z 2025-05-07T19:53:12.5942721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5943415Z 2025-05-07T19:53:12.5945167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.5947956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.5949218Z ^ 2025-05-07T19:53:12.5949592Z 2025-05-07T19:53:12.5951270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5953428Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5954021Z ^ 2025-05-07T19:53:12.5954331Z 2025-05-07T19:53:12.5955996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5958258Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5958885Z ^ 2025-05-07T19:53:12.5959201Z 2025-05-07T19:53:12.5960922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5963250Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5963835Z ^ 2025-05-07T19:53:12.5964179Z 2025-05-07T19:53:12.5966172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.5969044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.5970293Z ^ 2025-05-07T19:53:12.5970585Z 2025-05-07T19:53:12.5971052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5971776Z 2025-05-07T19:53:12.5973581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.5976439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.5977713Z ^ 2025-05-07T19:53:12.5978193Z 2025-05-07T19:53:12.5980110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5982280Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5982865Z ^ 2025-05-07T19:53:12.5983161Z 2025-05-07T19:53:12.5984983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5986933Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5987517Z ^ 2025-05-07T19:53:12.5987819Z 2025-05-07T19:53:12.5989429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.5991627Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.5992188Z ^ 2025-05-07T19:53:12.5992523Z 2025-05-07T19:53:12.5994253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.5997065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.5998289Z ^ 2025-05-07T19:53:12.5998578Z 2025-05-07T19:53:12.5999044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.5999746Z 2025-05-07T19:53:12.6001506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.6004334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.6005597Z ^ 2025-05-07T19:53:12.6006094Z 2025-05-07T19:53:12.6008032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.6010212Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.6010838Z ^ 2025-05-07T19:53:12.6011163Z 2025-05-07T19:53:12.6012888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.6015089Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.6015712Z ^ 2025-05-07T19:53:12.6016148Z 2025-05-07T19:53:12.6017841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.6019541Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.6020014Z ^ 2025-05-07T19:53:12.6020456Z 2025-05-07T19:53:12.6021810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.6024179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.6025500Z ^ 2025-05-07T19:53:12.6025765Z 2025-05-07T19:53:12.6026165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:12.6026762Z 2025-05-07T19:53:12.6030538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.6033331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:12.6034489Z ^ 2025-05-07T19:53:12.6034864Z 2025-05-07T19:53:12.6036614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.6038845Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.6039446Z ^ 2025-05-07T19:53:12.6039743Z 2025-05-07T19:53:12.6041356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.6043406Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.6043959Z ^ 2025-05-07T19:53:12.6044283Z 2025-05-07T19:53:12.6045900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:12.6048027Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:12.6048584Z ^ 2025-05-07T19:53:12.6048926Z 2025-05-07T19:53:14.6508407Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:14.6532626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6535230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6536475Z ^ 2025-05-07T19:53:14.6536760Z 2025-05-07T19:53:14.6537265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6537953Z 2025-05-07T19:53:14.6539670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6542407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6543600Z ^ 2025-05-07T19:53:14.6543902Z 2025-05-07T19:53:14.6545367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6547852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6548996Z ^ 2025-05-07T19:53:14.6549276Z 2025-05-07T19:53:14.6549720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6550380Z 2025-05-07T19:53:14.6552462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6555352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6556504Z ^ 2025-05-07T19:53:14.6556863Z 2025-05-07T19:53:14.6558559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6561378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6562632Z ^ 2025-05-07T19:53:14.6562900Z 2025-05-07T19:53:14.6563402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6564100Z 2025-05-07T19:53:14.6566135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6568929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6570181Z ^ 2025-05-07T19:53:14.6570977Z 2025-05-07T19:53:14.6572686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6575473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6576864Z ^ 2025-05-07T19:53:14.6577153Z 2025-05-07T19:53:14.6577618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6578300Z 2025-05-07T19:53:14.6579996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6582398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6583575Z ^ 2025-05-07T19:53:14.6583919Z 2025-05-07T19:53:14.6585566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6588202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6589366Z ^ 2025-05-07T19:53:14.6589625Z 2025-05-07T19:53:14.6590059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6590718Z 2025-05-07T19:53:14.6592505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6595001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6596050Z ^ 2025-05-07T19:53:14.6596435Z 2025-05-07T19:53:19.6144281Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:19.6168153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6171399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6172649Z ^ 2025-05-07T19:53:19.6172946Z 2025-05-07T19:53:19.6173425Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.6174102Z 2025-05-07T19:53:19.6175841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6178604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6179830Z ^ 2025-05-07T19:53:19.6180164Z 2025-05-07T19:53:19.6181737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6184940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6186167Z ^ 2025-05-07T19:53:19.6186413Z 2025-05-07T19:53:19.6186801Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.6187393Z 2025-05-07T19:53:19.6189163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6191816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6192957Z ^ 2025-05-07T19:53:19.6193339Z 2025-05-07T19:53:19.6194883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6197635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6198701Z ^ 2025-05-07T19:53:19.6198965Z 2025-05-07T19:53:19.6199445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.6200088Z 2025-05-07T19:53:19.6201759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6204863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6206020Z ^ 2025-05-07T19:53:19.6206365Z 2025-05-07T19:53:19.6208247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6210824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6211887Z ^ 2025-05-07T19:53:19.6212101Z 2025-05-07T19:53:19.6212478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.6213016Z 2025-05-07T19:53:19.6214382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6216885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6218058Z ^ 2025-05-07T19:53:19.6218429Z 2025-05-07T19:53:19.6220036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6222521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6223671Z ^ 2025-05-07T19:53:19.6223909Z 2025-05-07T19:53:19.6224357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.6224980Z 2025-05-07T19:53:19.6226488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.6229097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.6230211Z ^ 2025-05-07T19:53:19.6230578Z 2025-05-07T19:53:21.1434665Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:21.1458894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1461770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1463060Z ^ 2025-05-07T19:53:21.1463310Z 2025-05-07T19:53:21.1463767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.1464417Z 2025-05-07T19:53:21.1466759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1469454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1470596Z ^ 2025-05-07T19:53:21.1470950Z 2025-05-07T19:53:21.1472544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1474646Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.1475385Z ^ 2025-05-07T19:53:21.1475700Z 2025-05-07T19:53:21.1477202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1479188Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1479743Z ^ 2025-05-07T19:53:21.1480052Z 2025-05-07T19:53:21.1481607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1483571Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1484106Z ^ 2025-05-07T19:53:21.1484379Z 2025-05-07T19:53:21.1485933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1487727Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1488308Z ^ 2025-05-07T19:53:21.1489007Z 2025-05-07T19:53:21.1490660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1493310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1494712Z ^ 2025-05-07T19:53:21.1494958Z 2025-05-07T19:53:21.1495345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.1495988Z 2025-05-07T19:53:21.1497656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1500371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1501564Z ^ 2025-05-07T19:53:21.1501952Z 2025-05-07T19:53:21.1503490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1505652Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.1506407Z ^ 2025-05-07T19:53:21.1506662Z 2025-05-07T19:53:21.1508317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1510551Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1511076Z ^ 2025-05-07T19:53:21.1511354Z 2025-05-07T19:53:21.1512990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1515108Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1515679Z ^ 2025-05-07T19:53:21.1515961Z 2025-05-07T19:53:21.1517560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1519455Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1519956Z ^ 2025-05-07T19:53:21.1520226Z 2025-05-07T19:53:21.1521762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1524554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1525774Z ^ 2025-05-07T19:53:21.1526038Z 2025-05-07T19:53:21.1526507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.1527209Z 2025-05-07T19:53:21.1528943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1531725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1532916Z ^ 2025-05-07T19:53:21.1533286Z 2025-05-07T19:53:21.1534863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1537316Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.1538114Z ^ 2025-05-07T19:53:21.1538403Z 2025-05-07T19:53:21.1540133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1542089Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1542622Z ^ 2025-05-07T19:53:21.1543074Z 2025-05-07T19:53:21.1544523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1546572Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1547134Z ^ 2025-05-07T19:53:21.1547407Z 2025-05-07T19:53:21.1548983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1550999Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1551709Z ^ 2025-05-07T19:53:21.1552031Z 2025-05-07T19:53:21.1553735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1556197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1557352Z ^ 2025-05-07T19:53:21.1557623Z 2025-05-07T19:53:21.1558049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.1558669Z 2025-05-07T19:53:21.1560267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1562906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1564143Z ^ 2025-05-07T19:53:21.1564523Z 2025-05-07T19:53:21.1566398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1568583Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.1569377Z ^ 2025-05-07T19:53:21.1569649Z 2025-05-07T19:53:21.1571191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1573137Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1573642Z ^ 2025-05-07T19:53:21.1573906Z 2025-05-07T19:53:21.1575433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1577341Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1577888Z ^ 2025-05-07T19:53:21.1578618Z 2025-05-07T19:53:21.1580140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1582060Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1582605Z ^ 2025-05-07T19:53:21.1582889Z 2025-05-07T19:53:21.1584842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1587473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1588688Z ^ 2025-05-07T19:53:21.1588934Z 2025-05-07T19:53:21.1589394Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:21.1590063Z 2025-05-07T19:53:21.1591867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.1594772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:21.1595969Z ^ 2025-05-07T19:53:21.1596324Z 2025-05-07T19:53:21.1597840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1600027Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:21.1600797Z ^ 2025-05-07T19:53:21.1601147Z 2025-05-07T19:53:21.1602773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1604877Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1605421Z ^ 2025-05-07T19:53:21.1605724Z 2025-05-07T19:53:21.1607230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1608994Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1609511Z ^ 2025-05-07T19:53:21.1609785Z 2025-05-07T19:53:21.1611222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:21.1613128Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:21.1613676Z ^ 2025-05-07T19:53:21.1613937Z 2025-05-07T19:53:22.2258473Z [141/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:22.2281958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2284808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2286152Z ^ 2025-05-07T19:53:22.2286401Z 2025-05-07T19:53:22.2286879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.2287562Z 2025-05-07T19:53:22.2289116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2291728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2292935Z ^ 2025-05-07T19:53:22.2293294Z 2025-05-07T19:53:22.2294856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2296998Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:22.2297701Z ^ 2025-05-07T19:53:22.2298011Z 2025-05-07T19:53:22.2299524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2301722Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2302241Z ^ 2025-05-07T19:53:22.2302508Z 2025-05-07T19:53:22.2303889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2306201Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2306704Z ^ 2025-05-07T19:53:22.2306947Z 2025-05-07T19:53:22.2308651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2310686Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2311257Z ^ 2025-05-07T19:53:22.2311682Z 2025-05-07T19:53:22.2313390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2316105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2317308Z ^ 2025-05-07T19:53:22.2317523Z 2025-05-07T19:53:22.2317946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.2318647Z 2025-05-07T19:53:22.2320351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2323012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2324257Z ^ 2025-05-07T19:53:22.2324615Z 2025-05-07T19:53:22.2326194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2328381Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:22.2329178Z ^ 2025-05-07T19:53:22.2329469Z 2025-05-07T19:53:22.2331024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2333026Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2333630Z ^ 2025-05-07T19:53:22.2333903Z 2025-05-07T19:53:22.2335374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2337306Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2337909Z ^ 2025-05-07T19:53:22.2338186Z 2025-05-07T19:53:22.2339520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2341484Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2342051Z ^ 2025-05-07T19:53:22.2342375Z 2025-05-07T19:53:22.2343974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2346496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2347491Z ^ 2025-05-07T19:53:22.2347758Z 2025-05-07T19:53:22.2348165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.2349127Z 2025-05-07T19:53:22.2350703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2353500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2354742Z ^ 2025-05-07T19:53:22.2355100Z 2025-05-07T19:53:22.2356499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2358599Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:22.2359340Z ^ 2025-05-07T19:53:22.2359621Z 2025-05-07T19:53:22.2361145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2363093Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2363637Z ^ 2025-05-07T19:53:22.2363971Z 2025-05-07T19:53:22.2365738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2367728Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2368266Z ^ 2025-05-07T19:53:22.2368565Z 2025-05-07T19:53:22.2369977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2371970Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2372480Z ^ 2025-05-07T19:53:22.2372745Z 2025-05-07T19:53:22.2374446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2377039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2378105Z ^ 2025-05-07T19:53:22.2378320Z 2025-05-07T19:53:22.2378714Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.2379325Z 2025-05-07T19:53:22.2380980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2383643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2384835Z ^ 2025-05-07T19:53:22.2385198Z 2025-05-07T19:53:22.2386736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2388772Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:22.2389530Z ^ 2025-05-07T19:53:22.2389776Z 2025-05-07T19:53:22.2391357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2393925Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2394466Z ^ 2025-05-07T19:53:22.2394767Z 2025-05-07T19:53:22.2396553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2398279Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2398718Z ^ 2025-05-07T19:53:22.2398951Z 2025-05-07T19:53:22.2400306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2402170Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2402761Z ^ 2025-05-07T19:53:22.2403044Z 2025-05-07T19:53:22.2404780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2407542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2408754Z ^ 2025-05-07T19:53:22.2409000Z 2025-05-07T19:53:22.2409445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.2410137Z 2025-05-07T19:53:22.2411688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.2414274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.2415473Z ^ 2025-05-07T19:53:22.2415854Z 2025-05-07T19:53:22.2417401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2419570Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:22.2420246Z ^ 2025-05-07T19:53:22.2420532Z 2025-05-07T19:53:22.2422042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2424057Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2424597Z ^ 2025-05-07T19:53:22.2424839Z 2025-05-07T19:53:22.2426224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2428135Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2428685Z ^ 2025-05-07T19:53:22.2428948Z 2025-05-07T19:53:22.2430359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:22.2432505Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:22.2433086Z ^ 2025-05-07T19:53:22.2433374Z 2025-05-07T19:53:22.8972875Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:22.8994721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.8997432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.8998585Z ^ 2025-05-07T19:53:22.8999137Z 2025-05-07T19:53:22.8999596Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.9000302Z 2025-05-07T19:53:22.9001954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9004743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9005902Z ^ 2025-05-07T19:53:22.9006249Z 2025-05-07T19:53:22.9007963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9010814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9012088Z ^ 2025-05-07T19:53:22.9012355Z 2025-05-07T19:53:22.9012854Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.9013563Z 2025-05-07T19:53:22.9015660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9018333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9019805Z ^ 2025-05-07T19:53:22.9020190Z 2025-05-07T19:53:22.9021737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9024228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9025364Z ^ 2025-05-07T19:53:22.9025660Z 2025-05-07T19:53:22.9026106Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.9026770Z 2025-05-07T19:53:22.9028452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9031097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9032425Z ^ 2025-05-07T19:53:22.9032795Z 2025-05-07T19:53:22.9034484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9037203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9038636Z ^ 2025-05-07T19:53:22.9038900Z 2025-05-07T19:53:22.9039361Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.9040076Z 2025-05-07T19:53:22.9041809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9044615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9045854Z ^ 2025-05-07T19:53:22.9046259Z 2025-05-07T19:53:22.9047934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9050843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9052052Z ^ 2025-05-07T19:53:22.9052334Z 2025-05-07T19:53:22.9052790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:22.9053472Z 2025-05-07T19:53:22.9055007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.9057609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:22.9058771Z ^ 2025-05-07T19:53:22.9059134Z 2025-05-07T19:53:23.5254520Z [143/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:24.1206985Z [144/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:25.3916748Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:25.3940603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3943184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3944330Z ^ 2025-05-07T19:53:25.3944608Z 2025-05-07T19:53:25.3945062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.3945790Z 2025-05-07T19:53:25.3947422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3950507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3951852Z ^ 2025-05-07T19:53:25.3952213Z 2025-05-07T19:53:25.3954102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3956605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3957777Z ^ 2025-05-07T19:53:25.3958042Z 2025-05-07T19:53:25.3958521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.3959365Z 2025-05-07T19:53:25.3961063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3963819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3965301Z ^ 2025-05-07T19:53:25.3965699Z 2025-05-07T19:53:25.3967287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3969941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3971137Z ^ 2025-05-07T19:53:25.3971423Z 2025-05-07T19:53:25.3971870Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.3972547Z 2025-05-07T19:53:25.3974271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3977018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3978229Z ^ 2025-05-07T19:53:25.3978600Z 2025-05-07T19:53:25.3980322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3983021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3984255Z ^ 2025-05-07T19:53:25.3984510Z 2025-05-07T19:53:25.3984961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.3985683Z 2025-05-07T19:53:25.3987408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3990141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3991340Z ^ 2025-05-07T19:53:25.3991862Z 2025-05-07T19:53:25.3993389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.3996016Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.3997090Z ^ 2025-05-07T19:53:25.3997351Z 2025-05-07T19:53:25.3997972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:25.3998652Z 2025-05-07T19:53:25.4000382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.4003035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:25.4004231Z ^ 2025-05-07T19:53:25.4004619Z 2025-05-07T19:53:30.6356851Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:30.6381176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6384080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6385874Z ^ 2025-05-07T19:53:30.6386150Z 2025-05-07T19:53:30.6386649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.6387334Z 2025-05-07T19:53:30.6389074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6392192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6393422Z ^ 2025-05-07T19:53:30.6393838Z 2025-05-07T19:53:30.6395563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6398358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6399541Z ^ 2025-05-07T19:53:30.6399827Z 2025-05-07T19:53:30.6400290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.6400983Z 2025-05-07T19:53:30.6402750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6405523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6406734Z ^ 2025-05-07T19:53:30.6407103Z 2025-05-07T19:53:30.6408813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6411552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6412752Z ^ 2025-05-07T19:53:30.6413020Z 2025-05-07T19:53:30.6413479Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.6414182Z 2025-05-07T19:53:30.6415909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6418670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6419772Z ^ 2025-05-07T19:53:30.6420109Z 2025-05-07T19:53:30.6421599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6424402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6425718Z ^ 2025-05-07T19:53:30.6425983Z 2025-05-07T19:53:30.6426476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.6427172Z 2025-05-07T19:53:30.6428935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6431929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6433426Z ^ 2025-05-07T19:53:30.6433839Z 2025-05-07T19:53:30.6435583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6441232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6442586Z ^ 2025-05-07T19:53:30.6442893Z 2025-05-07T19:53:30.6443365Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.6444060Z 2025-05-07T19:53:30.6445862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.6448707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.6449981Z ^ 2025-05-07T19:53:30.6450364Z 2025-05-07T19:53:32.4990383Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:32.5014214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5017477Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5018680Z ^ 2025-05-07T19:53:32.5018935Z 2025-05-07T19:53:32.5019388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.5020294Z 2025-05-07T19:53:32.5021926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5024235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5025266Z ^ 2025-05-07T19:53:32.5025607Z 2025-05-07T19:53:32.5027246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5030006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5031254Z ^ 2025-05-07T19:53:32.5031676Z 2025-05-07T19:53:32.5032254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.5032786Z 2025-05-07T19:53:32.5034078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5036184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5037181Z ^ 2025-05-07T19:53:32.5037517Z 2025-05-07T19:53:32.5038996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5041749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5042922Z ^ 2025-05-07T19:53:32.5043204Z 2025-05-07T19:53:32.5043649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.5044303Z 2025-05-07T19:53:32.5046039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5048728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5049960Z ^ 2025-05-07T19:53:32.5050345Z 2025-05-07T19:53:32.5051927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5054251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5055228Z ^ 2025-05-07T19:53:32.5055441Z 2025-05-07T19:53:32.5055829Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.5056379Z 2025-05-07T19:53:32.5057879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5060830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5062021Z ^ 2025-05-07T19:53:32.5062397Z 2025-05-07T19:53:32.5064203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5067385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5068731Z ^ 2025-05-07T19:53:32.5069016Z 2025-05-07T19:53:32.5069417Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.5070073Z 2025-05-07T19:53:32.5071816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.5074509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.5075683Z ^ 2025-05-07T19:53:32.5076051Z 2025-05-07T19:53:32.9168791Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:32.9192728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9195771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9196946Z ^ 2025-05-07T19:53:32.9197208Z 2025-05-07T19:53:32.9197650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.9198310Z 2025-05-07T19:53:32.9199998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9202678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9203894Z ^ 2025-05-07T19:53:32.9204252Z 2025-05-07T19:53:32.9205941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9208618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9209806Z ^ 2025-05-07T19:53:32.9210066Z 2025-05-07T19:53:32.9210548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.9211213Z 2025-05-07T19:53:32.9212854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9215577Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9216747Z ^ 2025-05-07T19:53:32.9217136Z 2025-05-07T19:53:32.9218798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9221439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9222601Z ^ 2025-05-07T19:53:32.9222883Z 2025-05-07T19:53:32.9223298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.9223952Z 2025-05-07T19:53:32.9225345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9227759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9228774Z ^ 2025-05-07T19:53:32.9229110Z 2025-05-07T19:53:32.9230768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9232986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9234412Z ^ 2025-05-07T19:53:32.9234653Z 2025-05-07T19:53:32.9235107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.9235771Z 2025-05-07T19:53:32.9237343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9239754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9240830Z ^ 2025-05-07T19:53:32.9241159Z 2025-05-07T19:53:32.9242546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9244876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9245902Z ^ 2025-05-07T19:53:32.9246168Z 2025-05-07T19:53:32.9246608Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:32.9247257Z 2025-05-07T19:53:32.9248887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.9251592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:32.9252778Z ^ 2025-05-07T19:53:32.9253121Z 2025-05-07T19:53:46.5982452Z [149/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:53:46.6003233Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:46.9076228Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:53:46.9098732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9101483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9102538Z ^ 2025-05-07T19:53:46.9102802Z 2025-05-07T19:53:46.9103226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:46.9103836Z 2025-05-07T19:53:46.9105471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9108001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9109105Z ^ 2025-05-07T19:53:46.9109432Z 2025-05-07T19:53:46.9111053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9114341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9115393Z ^ 2025-05-07T19:53:46.9115617Z 2025-05-07T19:53:46.9116206Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:46.9116807Z 2025-05-07T19:53:46.9118263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9120707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9121856Z ^ 2025-05-07T19:53:46.9122242Z 2025-05-07T19:53:46.9123881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9126412Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9127500Z ^ 2025-05-07T19:53:46.9127736Z 2025-05-07T19:53:46.9128087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:46.9128678Z 2025-05-07T19:53:46.9130173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9132643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9133735Z ^ 2025-05-07T19:53:46.9134065Z 2025-05-07T19:53:46.9135563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9137907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9138837Z ^ 2025-05-07T19:53:46.9139040Z 2025-05-07T19:53:46.9139391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:46.9139922Z 2025-05-07T19:53:46.9141245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9143706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9144917Z ^ 2025-05-07T19:53:46.9145296Z 2025-05-07T19:53:46.9146987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9149733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9150733Z ^ 2025-05-07T19:53:46.9150976Z 2025-05-07T19:53:46.9151393Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:46.9152359Z 2025-05-07T19:53:46.9153983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9156940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:46.9158122Z ^ 2025-05-07T19:53:46.9158622Z 2025-05-07T19:53:55.4098208Z [151/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:55.4123512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4126308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4127536Z ^ 2025-05-07T19:53:55.4127793Z 2025-05-07T19:53:55.4128279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.4128958Z 2025-05-07T19:53:55.4130620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4133282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4135117Z ^ 2025-05-07T19:53:55.4135486Z 2025-05-07T19:53:55.4137177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4140144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4141397Z ^ 2025-05-07T19:53:55.4141664Z 2025-05-07T19:53:55.4142121Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.4142794Z 2025-05-07T19:53:55.4144429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4147213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4148464Z ^ 2025-05-07T19:53:55.4148837Z 2025-05-07T19:53:55.4150486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4153198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4154409Z ^ 2025-05-07T19:53:55.4154665Z 2025-05-07T19:53:55.4155143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.4155813Z 2025-05-07T19:53:55.4157522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4160315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4161555Z ^ 2025-05-07T19:53:55.4161940Z 2025-05-07T19:53:55.4163621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4166792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4168028Z ^ 2025-05-07T19:53:55.4168324Z 2025-05-07T19:53:55.4168795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.4169510Z 2025-05-07T19:53:55.4171444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4174235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4175536Z ^ 2025-05-07T19:53:55.4175925Z 2025-05-07T19:53:55.4177796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4180454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4182122Z ^ 2025-05-07T19:53:55.4182379Z 2025-05-07T19:53:55.4182815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.4183406Z 2025-05-07T19:53:55.4185212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.4188080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.4189343Z ^ 2025-05-07T19:53:55.4189751Z 2025-05-07T19:53:56.9837562Z [152/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:53:56.9857545Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:00.0422380Z [153/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:00.0441839Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:00.0657539Z [154/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:00.0676337Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:03.9731649Z [155/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:03.9751848Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:08.8458145Z [156/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:08.8477841Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:10.5643766Z [157/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:10.5662996Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:13.2223364Z [158/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:13.2240530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2242554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2243612Z ^ 2025-05-07T19:54:13.2243812Z 2025-05-07T19:54:13.2244143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:13.2244665Z 2025-05-07T19:54:13.2245859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2247840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2248698Z ^ 2025-05-07T19:54:13.2248991Z 2025-05-07T19:54:13.2250181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2252138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2252977Z ^ 2025-05-07T19:54:13.2253178Z 2025-05-07T19:54:13.2253531Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:13.2254004Z 2025-05-07T19:54:13.2255257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2257248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2258151Z ^ 2025-05-07T19:54:13.2258431Z 2025-05-07T19:54:13.2259632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2261584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2262464Z ^ 2025-05-07T19:54:13.2262982Z 2025-05-07T19:54:13.2263317Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:13.2263824Z 2025-05-07T19:54:13.2265348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2269873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2270930Z ^ 2025-05-07T19:54:13.2271199Z 2025-05-07T19:54:13.2272530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2274459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2275338Z ^ 2025-05-07T19:54:13.2275530Z 2025-05-07T19:54:13.2275889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:13.2276389Z 2025-05-07T19:54:13.2277590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2279516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2280394Z ^ 2025-05-07T19:54:13.2280664Z 2025-05-07T19:54:13.2281871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2283820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2284654Z ^ 2025-05-07T19:54:13.2284876Z 2025-05-07T19:54:13.2285207Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:13.2285689Z 2025-05-07T19:54:13.2286908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:13.2288834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:13.2289742Z ^ 2025-05-07T19:54:13.2290023Z 2025-05-07T19:54:14.1861814Z [159/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:14.1882853Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:15.9861142Z [160/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:15.9882420Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:17.2608945Z [161/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:17.2632460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2635094Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2636237Z ^ 2025-05-07T19:54:17.2636504Z 2025-05-07T19:54:17.2636943Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.2637614Z 2025-05-07T19:54:17.2639291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2641975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2643135Z ^ 2025-05-07T19:54:17.2643511Z 2025-05-07T19:54:17.2645107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2647887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2649109Z ^ 2025-05-07T19:54:17.2649380Z 2025-05-07T19:54:17.2649855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.2650574Z 2025-05-07T19:54:17.2652672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2655531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2656921Z ^ 2025-05-07T19:54:17.2669603Z 2025-05-07T19:54:17.2671550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2674288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2675563Z ^ 2025-05-07T19:54:17.2675820Z 2025-05-07T19:54:17.2676323Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.2677032Z 2025-05-07T19:54:17.2678877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2681605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2682860Z ^ 2025-05-07T19:54:17.2683271Z 2025-05-07T19:54:17.2684772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2687308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2688508Z ^ 2025-05-07T19:54:17.2688777Z 2025-05-07T19:54:17.2689380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.2690054Z 2025-05-07T19:54:17.2691712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2694388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2695511Z ^ 2025-05-07T19:54:17.2695853Z 2025-05-07T19:54:17.2697451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2700398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2701614Z ^ 2025-05-07T19:54:17.2701883Z 2025-05-07T19:54:17.2702322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:17.2702984Z 2025-05-07T19:54:17.2704697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.2707278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:17.2708436Z ^ 2025-05-07T19:54:17.2708781Z 2025-05-07T19:54:19.0029493Z [162/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:19.0050177Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.0184954Z [163/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:19.0205217Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.8653719Z [164/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:20.8675383Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.8997383Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.9018478Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.9344145Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.9364225Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:20.9695246Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:20.9716506Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.0044340Z [168/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.0065758Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.0384810Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.0405239Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.0724401Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.0747717Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.1062217Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.1083972Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.1405857Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.1424883Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.1748845Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.1770693Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.2088951Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.2109633Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.2420138Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.2439837Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.2757366Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.2779291Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.3094568Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:21.3116314Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:22.5297567Z [178/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:22.5318358Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:25.2929742Z [179/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:25.2951334Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.1575820Z [180/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:28.1597877Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:29.4281331Z [181/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:29.4305349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.4307852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.4309369Z ^ 2025-05-07T19:54:29.4309615Z 2025-05-07T19:54:29.4310067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.4310694Z 2025-05-07T19:54:29.4312413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.4314970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.4316128Z ^ 2025-05-07T19:54:29.4316469Z 2025-05-07T19:54:29.4317663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4319539Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4320389Z ^ 2025-05-07T19:54:29.4323634Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.4326505Z 2025-05-07T19:54:29.4327693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4329406Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4330155Z ^ 2025-05-07T19:54:29.4333368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.4336756Z 2025-05-07T19:54:29.4338190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4340035Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4340878Z ^ 2025-05-07T19:54:29.4344266Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.4347803Z 2025-05-07T19:54:29.4349084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4351061Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4352248Z ^ 2025-05-07T19:54:29.4355918Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.4359085Z 2025-05-07T19:54:29.4360329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4362229Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4363122Z ^ 2025-05-07T19:54:29.4366571Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.4369658Z 2025-05-07T19:54:29.4370920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4372812Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4373714Z ^ 2025-05-07T19:54:29.4377098Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.4380022Z 2025-05-07T19:54:29.4381172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4383000Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4383887Z ^ 2025-05-07T19:54:29.4386940Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.4389836Z 2025-05-07T19:54:29.4391751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4393723Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4394613Z ^ 2025-05-07T19:54:29.4398504Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.4402047Z 2025-05-07T19:54:29.4403382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4405432Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4406380Z ^ 2025-05-07T19:54:29.4410005Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.4413250Z 2025-05-07T19:54:29.4414526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4416566Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4417467Z ^ 2025-05-07T19:54:29.4420962Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.4424217Z 2025-05-07T19:54:29.4425505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4427730Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4428651Z ^ 2025-05-07T19:54:29.4432362Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.4435894Z 2025-05-07T19:54:29.4437174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4439445Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4440322Z ^ 2025-05-07T19:54:29.4443952Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.4447229Z 2025-05-07T19:54:29.4448500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4450454Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4451337Z ^ 2025-05-07T19:54:29.4454807Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.4458003Z 2025-05-07T19:54:29.4459313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4461328Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4462265Z ^ 2025-05-07T19:54:29.4466116Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.4469533Z 2025-05-07T19:54:29.4470774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4472843Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4473653Z ^ 2025-05-07T19:54:29.4476944Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.4480168Z 2025-05-07T19:54:29.4481388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4483686Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4484518Z ^ 2025-05-07T19:54:29.4488007Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.4490773Z 2025-05-07T19:54:29.4492006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4493902Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4494790Z ^ 2025-05-07T19:54:29.4498255Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.4501544Z 2025-05-07T19:54:29.4502865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4504927Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4505795Z ^ 2025-05-07T19:54:29.4509413Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.4512915Z 2025-05-07T19:54:29.4514148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4516060Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4516962Z ^ 2025-05-07T19:54:29.4520352Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.4523257Z 2025-05-07T19:54:29.4524505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4526593Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4527785Z ^ 2025-05-07T19:54:29.4531363Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.4534579Z 2025-05-07T19:54:29.4535971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4537983Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4538861Z ^ 2025-05-07T19:54:29.4542214Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.4545199Z 2025-05-07T19:54:29.4546437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4548177Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4549067Z ^ 2025-05-07T19:54:29.4552449Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.4555654Z 2025-05-07T19:54:29.4556941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4558906Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4559828Z ^ 2025-05-07T19:54:29.4563299Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.4566769Z 2025-05-07T19:54:29.4568050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4570064Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4571425Z ^ 2025-05-07T19:54:29.4575113Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.4578287Z 2025-05-07T19:54:29.4579937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.4582387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.4583513Z ^ 2025-05-07T19:54:29.4583798Z 2025-05-07T19:54:29.4584264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.4584961Z 2025-05-07T19:54:29.4586696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.4589381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.4590649Z ^ 2025-05-07T19:54:29.4590994Z 2025-05-07T19:54:29.4592267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4594019Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4594867Z ^ 2025-05-07T19:54:29.4598143Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.4601285Z 2025-05-07T19:54:29.4602567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4604827Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4605798Z ^ 2025-05-07T19:54:29.4609475Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.4612364Z 2025-05-07T19:54:29.4613457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4615407Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4620165Z ^ 2025-05-07T19:54:29.4623348Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.4626088Z 2025-05-07T19:54:29.4627291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4628962Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4629750Z ^ 2025-05-07T19:54:29.4633340Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.4636574Z 2025-05-07T19:54:29.4637846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4639841Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4640732Z ^ 2025-05-07T19:54:29.4644189Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.4647414Z 2025-05-07T19:54:29.4648665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4650514Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4651360Z ^ 2025-05-07T19:54:29.4654758Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.4657968Z 2025-05-07T19:54:29.4659023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4660904Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4662105Z ^ 2025-05-07T19:54:29.4666069Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.4669076Z 2025-05-07T19:54:29.4670219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4672339Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4673201Z ^ 2025-05-07T19:54:29.4676420Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.4679648Z 2025-05-07T19:54:29.4680992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4682655Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4683546Z ^ 2025-05-07T19:54:29.4686814Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.4689899Z 2025-05-07T19:54:29.4691131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4692838Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4693627Z ^ 2025-05-07T19:54:29.4696983Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.4700146Z 2025-05-07T19:54:29.4701381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4703305Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4704192Z ^ 2025-05-07T19:54:29.4708077Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.4711207Z 2025-05-07T19:54:29.4712581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4714492Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4715343Z ^ 2025-05-07T19:54:29.4718475Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.4721350Z 2025-05-07T19:54:29.4722481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4724261Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4725072Z ^ 2025-05-07T19:54:29.4728319Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.4731305Z 2025-05-07T19:54:29.4732467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4734235Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4735031Z ^ 2025-05-07T19:54:29.4738334Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.4741303Z 2025-05-07T19:54:29.4742580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4744344Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4745099Z ^ 2025-05-07T19:54:29.4748910Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.4752049Z 2025-05-07T19:54:29.4753279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4755097Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4755970Z ^ 2025-05-07T19:54:29.4759761Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.4763121Z 2025-05-07T19:54:29.4764349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4766492Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4767422Z ^ 2025-05-07T19:54:29.4771332Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.4774782Z 2025-05-07T19:54:29.4775945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4777901Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4778792Z ^ 2025-05-07T19:54:29.4782281Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.4785573Z 2025-05-07T19:54:29.4786838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4788836Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4789732Z ^ 2025-05-07T19:54:29.4793362Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.4797073Z 2025-05-07T19:54:29.4798529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4800445Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4801342Z ^ 2025-05-07T19:54:29.4804835Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.4808085Z 2025-05-07T19:54:29.4809359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4811341Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4812210Z ^ 2025-05-07T19:54:29.4815679Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.4818766Z 2025-05-07T19:54:29.4819834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4821730Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4822558Z ^ 2025-05-07T19:54:29.4825971Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.4829362Z 2025-05-07T19:54:29.4830563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4832759Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4833695Z ^ 2025-05-07T19:54:29.4836812Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.4840557Z 2025-05-07T19:54:29.4841930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4844014Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4844936Z ^ 2025-05-07T19:54:29.4848366Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.4851554Z 2025-05-07T19:54:29.4853165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.4855701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.4856913Z ^ 2025-05-07T19:54:29.4857170Z 2025-05-07T19:54:29.4857640Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.4858290Z 2025-05-07T19:54:29.4859870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.4862387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.4863529Z ^ 2025-05-07T19:54:29.4863869Z 2025-05-07T19:54:29.4865274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4867068Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4867971Z ^ 2025-05-07T19:54:29.4871623Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.4874899Z 2025-05-07T19:54:29.4876201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4878226Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4879120Z ^ 2025-05-07T19:54:29.4883033Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.4886519Z 2025-05-07T19:54:29.4887788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4889497Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4890273Z ^ 2025-05-07T19:54:29.4893464Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.4896533Z 2025-05-07T19:54:29.4897734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4899590Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4900456Z ^ 2025-05-07T19:54:29.4903813Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.4906738Z 2025-05-07T19:54:29.4907937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4909772Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4910633Z ^ 2025-05-07T19:54:29.4913972Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.4917058Z 2025-05-07T19:54:29.4918271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4920110Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4920977Z ^ 2025-05-07T19:54:29.4924237Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.4927633Z 2025-05-07T19:54:29.4929004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4930961Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4931847Z ^ 2025-05-07T19:54:29.4935195Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.4938339Z 2025-05-07T19:54:29.4939604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4941351Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4942227Z ^ 2025-05-07T19:54:29.4945175Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.4948273Z 2025-05-07T19:54:29.4949587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4951760Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4952673Z ^ 2025-05-07T19:54:29.4955736Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.4958810Z 2025-05-07T19:54:29.4959999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4961817Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4962720Z ^ 2025-05-07T19:54:29.4966175Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.4969700Z 2025-05-07T19:54:29.4970975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4972754Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4973582Z ^ 2025-05-07T19:54:29.4976865Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.4980064Z 2025-05-07T19:54:29.4981262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4983191Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4984127Z ^ 2025-05-07T19:54:29.4987815Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.4991234Z 2025-05-07T19:54:29.4992722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.4994638Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.4995473Z ^ 2025-05-07T19:54:29.4998866Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.5002113Z 2025-05-07T19:54:29.5003454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5005478Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5006340Z ^ 2025-05-07T19:54:29.5009869Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.5013309Z 2025-05-07T19:54:29.5014743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5016672Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5017592Z ^ 2025-05-07T19:54:29.5021039Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.5024254Z 2025-05-07T19:54:29.5025527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5027470Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5028365Z ^ 2025-05-07T19:54:29.5031864Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.5035067Z 2025-05-07T19:54:29.5036348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5038264Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5039170Z ^ 2025-05-07T19:54:29.5042618Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.5045875Z 2025-05-07T19:54:29.5047134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5049104Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5049983Z ^ 2025-05-07T19:54:29.5053441Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.5056955Z 2025-05-07T19:54:29.5058182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5060120Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5060935Z ^ 2025-05-07T19:54:29.5064280Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.5067869Z 2025-05-07T19:54:29.5069176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5071043Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5071798Z ^ 2025-05-07T19:54:29.5075058Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.5078052Z 2025-05-07T19:54:29.5079344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5081233Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5082125Z ^ 2025-05-07T19:54:29.5085478Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.5088432Z 2025-05-07T19:54:29.5089566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5091241Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5092034Z ^ 2025-05-07T19:54:29.5095028Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.5098252Z 2025-05-07T19:54:29.5099312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5101107Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5101954Z ^ 2025-05-07T19:54:29.5105183Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.5108242Z 2025-05-07T19:54:29.5109553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5111685Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5112572Z ^ 2025-05-07T19:54:29.5116070Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.5119295Z 2025-05-07T19:54:29.5120912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.5123515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.5124708Z ^ 2025-05-07T19:54:29.5124971Z 2025-05-07T19:54:29.5125438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.5126157Z 2025-05-07T19:54:29.5127872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.5130447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.5131514Z ^ 2025-05-07T19:54:29.5131878Z 2025-05-07T19:54:29.5133020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5134960Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5135797Z ^ 2025-05-07T19:54:29.5138839Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.5144590Z 2025-05-07T19:54:29.5145941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5147699Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5148479Z ^ 2025-05-07T19:54:29.5151685Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.5154715Z 2025-05-07T19:54:29.5155955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5157873Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5158761Z ^ 2025-05-07T19:54:29.5162025Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.5165253Z 2025-05-07T19:54:29.5166393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5168304Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5169161Z ^ 2025-05-07T19:54:29.5172368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.5175294Z 2025-05-07T19:54:29.5176495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5178394Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5179252Z ^ 2025-05-07T19:54:29.5182551Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.5186014Z 2025-05-07T19:54:29.5187221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5189215Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5190093Z ^ 2025-05-07T19:54:29.5193711Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.5197086Z 2025-05-07T19:54:29.5198116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5200057Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5200856Z ^ 2025-05-07T19:54:29.5204134Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.5207066Z 2025-05-07T19:54:29.5208133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5209796Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5210513Z ^ 2025-05-07T19:54:29.5213630Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.5216280Z 2025-05-07T19:54:29.5217387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5219254Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5220079Z ^ 2025-05-07T19:54:29.5223389Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.5226867Z 2025-05-07T19:54:29.5228182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5230174Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5231184Z ^ 2025-05-07T19:54:29.5234909Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.5238146Z 2025-05-07T19:54:29.5239459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5241437Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5242382Z ^ 2025-05-07T19:54:29.5245939Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.5249179Z 2025-05-07T19:54:29.5250656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5252679Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5253610Z ^ 2025-05-07T19:54:29.5257208Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.5260912Z 2025-05-07T19:54:29.5262223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5264247Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5265497Z ^ 2025-05-07T19:54:29.5269037Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.5272764Z 2025-05-07T19:54:29.5274038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5276003Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5277064Z ^ 2025-05-07T19:54:29.5280449Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.5283465Z 2025-05-07T19:54:29.5284675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5286376Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5287213Z ^ 2025-05-07T19:54:29.5290547Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.5293535Z 2025-05-07T19:54:29.5294723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5296678Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5297556Z ^ 2025-05-07T19:54:29.5301079Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.5304337Z 2025-05-07T19:54:29.5305545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5307478Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5308308Z ^ 2025-05-07T19:54:29.5311950Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.5315272Z 2025-05-07T19:54:29.5316865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5318834Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5319697Z ^ 2025-05-07T19:54:29.5323410Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.5326613Z 2025-05-07T19:54:29.5327770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5329609Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5330532Z ^ 2025-05-07T19:54:29.5334054Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.5337503Z 2025-05-07T19:54:29.5338839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5340827Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5341592Z ^ 2025-05-07T19:54:29.5344805Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.5347933Z 2025-05-07T19:54:29.5349169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5351113Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5352186Z ^ 2025-05-07T19:54:29.5355459Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.5358535Z 2025-05-07T19:54:29.5359732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5362060Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5362982Z ^ 2025-05-07T19:54:29.5366960Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.5370204Z 2025-05-07T19:54:29.5371499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5373509Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5374399Z ^ 2025-05-07T19:54:29.5377845Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.5381272Z 2025-05-07T19:54:29.5382435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5384111Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5384996Z ^ 2025-05-07T19:54:29.5388598Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.5392174Z 2025-05-07T19:54:29.5394068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.5396880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.5398103Z ^ 2025-05-07T19:54:29.5398391Z 2025-05-07T19:54:29.5398863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.5399560Z 2025-05-07T19:54:29.5401286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.5403900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.5405117Z ^ 2025-05-07T19:54:29.5405944Z 2025-05-07T19:54:29.5407226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5409283Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5410395Z ^ 2025-05-07T19:54:29.5414125Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.5417359Z 2025-05-07T19:54:29.5418574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5420525Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5421385Z ^ 2025-05-07T19:54:29.5424795Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.5428068Z 2025-05-07T19:54:29.5429343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5431278Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5432329Z ^ 2025-05-07T19:54:29.5435899Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.5439211Z 2025-05-07T19:54:29.5440453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5442392Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5443259Z ^ 2025-05-07T19:54:29.5446588Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.5449865Z 2025-05-07T19:54:29.5451176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5453505Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5454277Z ^ 2025-05-07T19:54:29.5457424Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.5460743Z 2025-05-07T19:54:29.5462056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5464098Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5465205Z ^ 2025-05-07T19:54:29.5468400Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.5471502Z 2025-05-07T19:54:29.5472698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5474624Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5475518Z ^ 2025-05-07T19:54:29.5478584Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.5481649Z 2025-05-07T19:54:29.5482900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5484705Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5485616Z ^ 2025-05-07T19:54:29.5488877Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.5491781Z 2025-05-07T19:54:29.5492946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5495255Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5496185Z ^ 2025-05-07T19:54:29.5499605Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.5502584Z 2025-05-07T19:54:29.5503909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5505969Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5506916Z ^ 2025-05-07T19:54:29.5510553Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.5514130Z 2025-05-07T19:54:29.5515476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5517535Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5518454Z ^ 2025-05-07T19:54:29.5522226Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.5525668Z 2025-05-07T19:54:29.5526959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5528935Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5529857Z ^ 2025-05-07T19:54:29.5533447Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.5536819Z 2025-05-07T19:54:29.5538133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5540335Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5541214Z ^ 2025-05-07T19:54:29.5544992Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.5548349Z 2025-05-07T19:54:29.5549635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5551774Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5552658Z ^ 2025-05-07T19:54:29.5556089Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.5559219Z 2025-05-07T19:54:29.5560399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5562407Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5563280Z ^ 2025-05-07T19:54:29.5566934Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.5570161Z 2025-05-07T19:54:29.5571487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5573465Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5574385Z ^ 2025-05-07T19:54:29.5577710Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.5580774Z 2025-05-07T19:54:29.5581955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5583741Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5584955Z ^ 2025-05-07T19:54:29.5588497Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.5591848Z 2025-05-07T19:54:29.5593094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5595047Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5595970Z ^ 2025-05-07T19:54:29.5599471Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.5602828Z 2025-05-07T19:54:29.5604076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5606086Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5607028Z ^ 2025-05-07T19:54:29.5610501Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.5613695Z 2025-05-07T19:54:29.5614961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5616840Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5617687Z ^ 2025-05-07T19:54:29.5620971Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.5623846Z 2025-05-07T19:54:29.5625165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5627159Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5628367Z ^ 2025-05-07T19:54:29.5632120Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.5635307Z 2025-05-07T19:54:29.5636646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5638548Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5639463Z ^ 2025-05-07T19:54:29.5642823Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.5645912Z 2025-05-07T19:54:29.5647253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5649306Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5650430Z ^ 2025-05-07T19:54:29.5654101Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.5657517Z 2025-05-07T19:54:29.5658880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.5660824Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.5661727Z ^ 2025-05-07T19:54:29.5665430Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.5668480Z 2025-05-07T19:54:29.7654190Z [182/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:29.7679112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7681847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7683013Z ^ 2025-05-07T19:54:29.7683299Z 2025-05-07T19:54:29.7683720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.7684351Z 2025-05-07T19:54:29.7686021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7688299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7689313Z ^ 2025-05-07T19:54:29.7689633Z 2025-05-07T19:54:29.7691268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7693891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7695080Z ^ 2025-05-07T19:54:29.7695342Z 2025-05-07T19:54:29.7695787Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.7696485Z 2025-05-07T19:54:29.7698153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7701364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7702568Z ^ 2025-05-07T19:54:29.7702958Z 2025-05-07T19:54:29.7704762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7707472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7708603Z ^ 2025-05-07T19:54:29.7708895Z 2025-05-07T19:54:29.7709352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.7710040Z 2025-05-07T19:54:29.7711866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7714585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7715731Z ^ 2025-05-07T19:54:29.7716103Z 2025-05-07T19:54:29.7717731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7720008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7721087Z ^ 2025-05-07T19:54:29.7721321Z 2025-05-07T19:54:29.7721743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.7722417Z 2025-05-07T19:54:29.7724064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7726917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7728130Z ^ 2025-05-07T19:54:29.7728507Z 2025-05-07T19:54:29.7730163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7732872Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7733961Z ^ 2025-05-07T19:54:29.7734188Z 2025-05-07T19:54:29.7734609Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.7735103Z 2025-05-07T19:54:29.7736662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:29.7739305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:29.7740478Z ^ 2025-05-07T19:54:29.7740863Z 2025-05-07T19:54:30.2117743Z [183/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:30.2141062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2143679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2144851Z ^ 2025-05-07T19:54:30.2145100Z 2025-05-07T19:54:30.2145529Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.2146243Z 2025-05-07T19:54:30.2147905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2150445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2151715Z ^ 2025-05-07T19:54:30.2152116Z 2025-05-07T19:54:30.2153783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2156505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2158030Z ^ 2025-05-07T19:54:30.2158307Z 2025-05-07T19:54:30.2158756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.2159437Z 2025-05-07T19:54:30.2161193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2163626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2164954Z ^ 2025-05-07T19:54:30.2165294Z 2025-05-07T19:54:30.2166835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2169501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2170618Z ^ 2025-05-07T19:54:30.2170864Z 2025-05-07T19:54:30.2171309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.2172004Z 2025-05-07T19:54:30.2173485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2176117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2177236Z ^ 2025-05-07T19:54:30.2177565Z 2025-05-07T19:54:30.2178959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2181348Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2182543Z ^ 2025-05-07T19:54:30.2182771Z 2025-05-07T19:54:30.2183241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.2183905Z 2025-05-07T19:54:30.2185536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2188316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2189561Z ^ 2025-05-07T19:54:30.2189954Z 2025-05-07T19:54:30.2191773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2194468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2195614Z ^ 2025-05-07T19:54:30.2195895Z 2025-05-07T19:54:30.2196341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:30.2196918Z 2025-05-07T19:54:30.2198466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.2201111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:30.2202793Z ^ 2025-05-07T19:54:30.2203157Z 2025-05-07T19:54:30.3046797Z [184/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3066694Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:32.3525843Z [185/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:32.3546464Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:32.7552613Z [186/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.7572138Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.5975374Z [187/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:33.5999476Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.6420929Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:33.6440247Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.6730746Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:33.6752184Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.7185405Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:33.7207691Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.7637973Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:33.7659216Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.8084129Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:33.8105839Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.8386600Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:33.8407752Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:37.8641326Z [194/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:37.8660673Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:39.9541767Z [195/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:39.9566535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9569248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9570376Z ^ 2025-05-07T19:54:39.9570654Z 2025-05-07T19:54:39.9571086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.9571733Z 2025-05-07T19:54:39.9573336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9575933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9577107Z ^ 2025-05-07T19:54:39.9577460Z 2025-05-07T19:54:39.9578959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9581327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9582534Z ^ 2025-05-07T19:54:39.9582789Z 2025-05-07T19:54:39.9583249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.9583854Z 2025-05-07T19:54:39.9585317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9587708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9588764Z ^ 2025-05-07T19:54:39.9589120Z 2025-05-07T19:54:39.9590609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9593069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9594135Z ^ 2025-05-07T19:54:39.9594760Z 2025-05-07T19:54:39.9595162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.9595739Z 2025-05-07T19:54:39.9597190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9600055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9601211Z ^ 2025-05-07T19:54:39.9601566Z 2025-05-07T19:54:39.9603206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9605788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9606970Z ^ 2025-05-07T19:54:39.9607220Z 2025-05-07T19:54:39.9607638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.9608327Z 2025-05-07T19:54:39.9609940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9612505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9613631Z ^ 2025-05-07T19:54:39.9614002Z 2025-05-07T19:54:39.9615494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9617777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9618785Z ^ 2025-05-07T19:54:39.9619026Z 2025-05-07T19:54:39.9619412Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:39.9619991Z 2025-05-07T19:54:39.9621405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9623672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:39.9624701Z ^ 2025-05-07T19:54:39.9625012Z 2025-05-07T19:54:40.3257784Z [196/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:40.3282752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3285628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3299202Z ^ 2025-05-07T19:54:40.3299597Z 2025-05-07T19:54:40.3300075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.3300782Z 2025-05-07T19:54:40.3302566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3305364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3306623Z ^ 2025-05-07T19:54:40.3307004Z 2025-05-07T19:54:40.3308764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3311707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3312970Z ^ 2025-05-07T19:54:40.3313217Z 2025-05-07T19:54:40.3313665Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.3314369Z 2025-05-07T19:54:40.3316091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3318887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3320085Z ^ 2025-05-07T19:54:40.3320496Z 2025-05-07T19:54:40.3322223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3325458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3326696Z ^ 2025-05-07T19:54:40.3326986Z 2025-05-07T19:54:40.3327448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.3329151Z 2025-05-07T19:54:40.3330944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3333818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3335060Z ^ 2025-05-07T19:54:40.3335436Z 2025-05-07T19:54:40.3337188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3340003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3341244Z ^ 2025-05-07T19:54:40.3341515Z 2025-05-07T19:54:40.3341977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.3342693Z 2025-05-07T19:54:40.3344397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3347177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3348414Z ^ 2025-05-07T19:54:40.3348791Z 2025-05-07T19:54:40.3350522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3353474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3354732Z ^ 2025-05-07T19:54:40.3355002Z 2025-05-07T19:54:40.3355494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:40.3356191Z 2025-05-07T19:54:40.3357965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.3360829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:40.3362093Z ^ 2025-05-07T19:54:40.3362478Z 2025-05-07T19:54:41.1876911Z [197/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:41.1900016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1902928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1904117Z ^ 2025-05-07T19:54:41.1904390Z 2025-05-07T19:54:41.1904832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.1905691Z 2025-05-07T19:54:41.1907421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1910161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1911542Z ^ 2025-05-07T19:54:41.1911920Z 2025-05-07T19:54:41.1913625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1916309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1917514Z ^ 2025-05-07T19:54:41.1917770Z 2025-05-07T19:54:41.1918228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.1918943Z 2025-05-07T19:54:41.1920625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1923449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1924948Z ^ 2025-05-07T19:54:41.1925326Z 2025-05-07T19:54:41.1926950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1929744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1930875Z ^ 2025-05-07T19:54:41.1931159Z 2025-05-07T19:54:41.1931616Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.1932256Z 2025-05-07T19:54:41.1934087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1937018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1938254Z ^ 2025-05-07T19:54:41.1938635Z 2025-05-07T19:54:41.1940326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1943180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1944373Z ^ 2025-05-07T19:54:41.1944634Z 2025-05-07T19:54:41.1945075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.1945766Z 2025-05-07T19:54:41.1947426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1949598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1950438Z ^ 2025-05-07T19:54:41.1950731Z 2025-05-07T19:54:41.1952278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1955119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1956068Z ^ 2025-05-07T19:54:41.1956302Z 2025-05-07T19:54:41.1956729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.1957412Z 2025-05-07T19:54:41.1959127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.1962128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:41.1963116Z ^ 2025-05-07T19:54:41.1963435Z 2025-05-07T19:54:42.5071908Z [198/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.5091176Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.7124855Z [199/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:42.7146997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7149554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7150651Z ^ 2025-05-07T19:54:42.7150904Z 2025-05-07T19:54:42.7151313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:42.7152126Z 2025-05-07T19:54:42.7153635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7156126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7157202Z ^ 2025-05-07T19:54:42.7157593Z 2025-05-07T19:54:42.7159106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7161725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7162814Z ^ 2025-05-07T19:54:42.7163054Z 2025-05-07T19:54:42.7163517Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:42.7164136Z 2025-05-07T19:54:42.7165902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7168350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7169422Z ^ 2025-05-07T19:54:42.7169740Z 2025-05-07T19:54:42.7171143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7173589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7174675Z ^ 2025-05-07T19:54:42.7174914Z 2025-05-07T19:54:42.7175299Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:42.7175889Z 2025-05-07T19:54:42.7177399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7179864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7180948Z ^ 2025-05-07T19:54:42.7181278Z 2025-05-07T19:54:42.7182745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7185602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7186670Z ^ 2025-05-07T19:54:42.7186919Z 2025-05-07T19:54:42.7187631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:42.7188228Z 2025-05-07T19:54:42.7189664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7192188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7193309Z ^ 2025-05-07T19:54:42.7193644Z 2025-05-07T19:54:42.7195082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7197414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7198414Z ^ 2025-05-07T19:54:42.7198689Z 2025-05-07T19:54:42.7199085Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:42.7199544Z 2025-05-07T19:54:42.7200859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.7203127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:42.7204211Z ^ 2025-05-07T19:54:42.7204537Z 2025-05-07T19:54:42.8559999Z [200/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.8580788Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:43.2131060Z [201/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:43.2149720Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:46.4906645Z [202/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:46.4927451Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:46.8678747Z [203/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:46.8701027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8704061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8705209Z ^ 2025-05-07T19:54:46.8705459Z 2025-05-07T19:54:46.8705876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8706725Z 2025-05-07T19:54:46.8708129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8710719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8712035Z ^ 2025-05-07T19:54:46.8712430Z 2025-05-07T19:54:46.8714034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8716648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8717781Z ^ 2025-05-07T19:54:46.8718035Z 2025-05-07T19:54:46.8718494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8719141Z 2025-05-07T19:54:46.8720727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8723304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8724484Z ^ 2025-05-07T19:54:46.8724832Z 2025-05-07T19:54:46.8726166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8728465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8729453Z ^ 2025-05-07T19:54:46.8729667Z 2025-05-07T19:54:46.8730007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8730603Z 2025-05-07T19:54:46.8732023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8734127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8735054Z ^ 2025-05-07T19:54:46.8735366Z 2025-05-07T19:54:46.8736787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8739191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8740276Z ^ 2025-05-07T19:54:46.8740500Z 2025-05-07T19:54:46.8740916Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8741662Z 2025-05-07T19:54:46.8743134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8745832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8746949Z ^ 2025-05-07T19:54:46.8747297Z 2025-05-07T19:54:46.8748965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8751673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8752672Z ^ 2025-05-07T19:54:46.8752910Z 2025-05-07T19:54:46.8753273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.8753815Z 2025-05-07T19:54:46.8755241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:46.8757718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:46.8758839Z ^ 2025-05-07T19:54:46.8759163Z 2025-05-07T19:54:47.3083355Z [204/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:47.3093624Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:47.3600093Z [205/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:47.3612600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3614025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3614676Z ^ 2025-05-07T19:54:47.3614824Z 2025-05-07T19:54:47.3615072Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.3615442Z 2025-05-07T19:54:47.3616311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3617731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3618367Z ^ 2025-05-07T19:54:47.3618583Z 2025-05-07T19:54:47.3619478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3620551Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3620972Z ^ 2025-05-07T19:54:47.3621138Z 2025-05-07T19:54:47.3621969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3623106Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3623416Z ^ 2025-05-07T19:54:47.3623602Z 2025-05-07T19:54:47.3624430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3625482Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3625786Z ^ 2025-05-07T19:54:47.3625965Z 2025-05-07T19:54:47.3626824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3628238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3628862Z ^ 2025-05-07T19:54:47.3629012Z 2025-05-07T19:54:47.3629272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.3629630Z 2025-05-07T19:54:47.3630487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3632053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3632707Z ^ 2025-05-07T19:54:47.3632907Z 2025-05-07T19:54:47.3633736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3634796Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3635103Z ^ 2025-05-07T19:54:47.3635277Z 2025-05-07T19:54:47.3636100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3637147Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3637449Z ^ 2025-05-07T19:54:47.3637621Z 2025-05-07T19:54:47.3638446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3639496Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3639796Z ^ 2025-05-07T19:54:47.3639956Z 2025-05-07T19:54:47.3640832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3642211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3642838Z ^ 2025-05-07T19:54:47.3642977Z 2025-05-07T19:54:47.3643235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.3643588Z 2025-05-07T19:54:47.3644542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3645949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3646657Z ^ 2025-05-07T19:54:47.3646860Z 2025-05-07T19:54:47.3647698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3648755Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3649056Z ^ 2025-05-07T19:54:47.3649233Z 2025-05-07T19:54:47.3650055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3651117Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3651418Z ^ 2025-05-07T19:54:47.3651597Z 2025-05-07T19:54:47.3652423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3653471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3653771Z ^ 2025-05-07T19:54:47.3653935Z 2025-05-07T19:54:47.3654806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3656199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3656829Z ^ 2025-05-07T19:54:47.3656971Z 2025-05-07T19:54:47.3657229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.3657581Z 2025-05-07T19:54:47.3658451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3659853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3660480Z ^ 2025-05-07T19:54:47.3660690Z 2025-05-07T19:54:47.3661520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3662578Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3662882Z ^ 2025-05-07T19:54:47.3663056Z 2025-05-07T19:54:47.3663881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3665256Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3665564Z ^ 2025-05-07T19:54:47.3665730Z 2025-05-07T19:54:47.3666570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3667807Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3668127Z ^ 2025-05-07T19:54:47.3668290Z 2025-05-07T19:54:47.3669160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3670669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3671307Z ^ 2025-05-07T19:54:47.3671555Z 2025-05-07T19:54:47.3671800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:47.3672172Z 2025-05-07T19:54:47.3673036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.3674450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:47.3675079Z ^ 2025-05-07T19:54:47.3675293Z 2025-05-07T19:54:47.3676129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3677182Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3677487Z ^ 2025-05-07T19:54:47.3677662Z 2025-05-07T19:54:47.3678483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3679537Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3679837Z ^ 2025-05-07T19:54:47.3679999Z 2025-05-07T19:54:47.3680836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.3681878Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:47.3682191Z ^ 2025-05-07T19:54:47.3682351Z 2025-05-07T19:54:48.1800513Z [206/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:48.1820692Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:48.5693852Z [207/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:48.5718107Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:49.6466149Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:49.6484337Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.3975464Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:53.3995620Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:54.4880311Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:54:55.3867820Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.3885358Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:54:55.3904744Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.6606995Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:54:55.6622931Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.7472223Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:54:55.7490267Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.1832173Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:54:56.1849963Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.7689364Z [215/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:57.7710233Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.3408656Z [216/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:58.3429129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3431233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3432304Z ^ 2025-05-07T19:54:58.3432548Z 2025-05-07T19:54:58.3432915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.3433445Z 2025-05-07T19:54:58.3434730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3436785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3437722Z ^ 2025-05-07T19:54:58.3438022Z 2025-05-07T19:54:58.3439302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3441380Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3442346Z ^ 2025-05-07T19:54:58.3442570Z 2025-05-07T19:54:58.3442955Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.3443519Z 2025-05-07T19:54:58.3444810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3447077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3448041Z ^ 2025-05-07T19:54:58.3448378Z 2025-05-07T19:54:58.3449848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3452000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3453176Z ^ 2025-05-07T19:54:58.3453387Z 2025-05-07T19:54:58.3453759Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.3454296Z 2025-05-07T19:54:58.3455636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3457878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3458837Z ^ 2025-05-07T19:54:58.3459116Z 2025-05-07T19:54:58.3460363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3462557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3463481Z ^ 2025-05-07T19:54:58.3463681Z 2025-05-07T19:54:58.3464202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.3465106Z 2025-05-07T19:54:58.3466408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3468452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3469422Z ^ 2025-05-07T19:54:58.3469726Z 2025-05-07T19:54:58.3471015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3473192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3474118Z ^ 2025-05-07T19:54:58.3474343Z 2025-05-07T19:54:58.3474731Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:58.3475260Z 2025-05-07T19:54:58.3476542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.3478623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:58.3479599Z ^ 2025-05-07T19:54:58.3479897Z 2025-05-07T19:54:58.9867859Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:54:58.9888805Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.0732356Z [218/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:54:59.0752199Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.4620748Z [219/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:54:59.4641117Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.5284120Z [220/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:54:59.5304156Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.0363540Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:00.0384659Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.5057302Z [222/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:00.5080094Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.6207040Z [223/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:00.6224723Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.6376316Z [224/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:00.6395273Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.8553829Z [225/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:00.8572819Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.0059822Z [226/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:01.0078994Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.2303150Z [227/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && : 2025-05-07T19:55:02.9872874Z [228/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:02.9891565Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.8489027Z [229/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:05.8513542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8516098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8517275Z ^ 2025-05-07T19:55:05.8517514Z 2025-05-07T19:55:05.8517962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.8518580Z 2025-05-07T19:55:05.8520134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8523172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8524370Z ^ 2025-05-07T19:55:05.8524736Z 2025-05-07T19:55:05.8526450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8528477Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8529063Z ^ 2025-05-07T19:55:05.8529469Z 2025-05-07T19:55:05.8531068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8533009Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8533695Z ^ 2025-05-07T19:55:05.8533980Z 2025-05-07T19:55:05.8535633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8537712Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8538236Z ^ 2025-05-07T19:55:05.8538541Z 2025-05-07T19:55:05.8540074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8542650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8543777Z ^ 2025-05-07T19:55:05.8544037Z 2025-05-07T19:55:05.8544461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.8545096Z 2025-05-07T19:55:05.8546538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8548966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8550084Z ^ 2025-05-07T19:55:05.8550441Z 2025-05-07T19:55:05.8552196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8554361Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8554948Z ^ 2025-05-07T19:55:05.8555254Z 2025-05-07T19:55:05.8556778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8558844Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8559322Z ^ 2025-05-07T19:55:05.8559631Z 2025-05-07T19:55:05.8561158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8563187Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8564001Z ^ 2025-05-07T19:55:05.8564328Z 2025-05-07T19:55:05.8566271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8569045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8570274Z ^ 2025-05-07T19:55:05.8570536Z 2025-05-07T19:55:05.8571028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.8571724Z 2025-05-07T19:55:05.8573417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8576501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8577887Z ^ 2025-05-07T19:55:05.8578260Z 2025-05-07T19:55:05.8580123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8582093Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8582701Z ^ 2025-05-07T19:55:05.8583008Z 2025-05-07T19:55:05.8584652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8586721Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8587285Z ^ 2025-05-07T19:55:05.8587619Z 2025-05-07T19:55:05.8589241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8591262Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8591973Z ^ 2025-05-07T19:55:05.8592297Z 2025-05-07T19:55:05.8593968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8597000Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8598116Z ^ 2025-05-07T19:55:05.8598366Z 2025-05-07T19:55:05.8598831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.8599459Z 2025-05-07T19:55:05.8601041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8603915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8605143Z ^ 2025-05-07T19:55:05.8605505Z 2025-05-07T19:55:05.8606763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8608540Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8609329Z ^ 2025-05-07T19:55:05.8609624Z 2025-05-07T19:55:05.8611150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8613209Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8613772Z ^ 2025-05-07T19:55:05.8614124Z 2025-05-07T19:55:05.8615802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8617936Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8618444Z ^ 2025-05-07T19:55:05.8618901Z 2025-05-07T19:55:05.8620542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8623530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8624677Z ^ 2025-05-07T19:55:05.8624926Z 2025-05-07T19:55:05.8625382Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:05.8626034Z 2025-05-07T19:55:05.8627690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8630323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:05.8631600Z ^ 2025-05-07T19:55:05.8631978Z 2025-05-07T19:55:05.8633549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8635553Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8636086Z ^ 2025-05-07T19:55:05.8636398Z 2025-05-07T19:55:05.8637811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8639630Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8640169Z ^ 2025-05-07T19:55:05.8640531Z 2025-05-07T19:55:05.8642038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:05.8644207Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:05.8644845Z ^ 2025-05-07T19:55:05.8645156Z 2025-05-07T19:55:06.3413370Z [230/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:06.3435101Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.9681254Z [231/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:06.9702526Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.6740243Z [232/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:09.6758613Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.7602942Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:09.7622042Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.7768914Z [234/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:09.7788043Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.4631733Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:10.4650973Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.6079092Z [236/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:10.6101982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6104643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6105823Z ^ 2025-05-07T19:55:10.6106098Z 2025-05-07T19:55:10.6106501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:10.6107124Z 2025-05-07T19:55:10.6108633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6111744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6112962Z ^ 2025-05-07T19:55:10.6113308Z 2025-05-07T19:55:10.6115190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6117002Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:10.6117595Z ^ 2025-05-07T19:55:10.6117857Z 2025-05-07T19:55:10.6119205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6120879Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6121414Z ^ 2025-05-07T19:55:10.6121711Z 2025-05-07T19:55:10.6123089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6124853Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6125383Z ^ 2025-05-07T19:55:10.6125666Z 2025-05-07T19:55:10.6127075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6128862Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6129376Z ^ 2025-05-07T19:55:10.6129635Z 2025-05-07T19:55:10.6131192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6133643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6134775Z ^ 2025-05-07T19:55:10.6135018Z 2025-05-07T19:55:10.6135473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:10.6136078Z 2025-05-07T19:55:10.6137596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6140036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6141126Z ^ 2025-05-07T19:55:10.6141496Z 2025-05-07T19:55:10.6142909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6145047Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:10.6145778Z ^ 2025-05-07T19:55:10.6146082Z 2025-05-07T19:55:10.6147460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6149334Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6149846Z ^ 2025-05-07T19:55:10.6150107Z 2025-05-07T19:55:10.6151683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6153669Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6154192Z ^ 2025-05-07T19:55:10.6154450Z 2025-05-07T19:55:10.6156071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6157954Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6158501Z ^ 2025-05-07T19:55:10.6158725Z 2025-05-07T19:55:10.6160219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6163063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6164235Z ^ 2025-05-07T19:55:10.6164482Z 2025-05-07T19:55:10.6165168Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:10.6165868Z 2025-05-07T19:55:10.6167671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6169808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6170857Z ^ 2025-05-07T19:55:10.6171196Z 2025-05-07T19:55:10.6172483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6174417Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:10.6175101Z ^ 2025-05-07T19:55:10.6175361Z 2025-05-07T19:55:10.6176783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6178583Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6179125Z ^ 2025-05-07T19:55:10.6179395Z 2025-05-07T19:55:10.6180784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6182631Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6183177Z ^ 2025-05-07T19:55:10.6183423Z 2025-05-07T19:55:10.6185133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6187193Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6187740Z ^ 2025-05-07T19:55:10.6187995Z 2025-05-07T19:55:10.6189519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6192161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6193248Z ^ 2025-05-07T19:55:10.6195365Z 2025-05-07T19:55:10.6195788Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:10.6196423Z 2025-05-07T19:55:10.6198083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6200836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6202010Z ^ 2025-05-07T19:55:10.6202339Z 2025-05-07T19:55:10.6203784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6205810Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:10.6206535Z ^ 2025-05-07T19:55:10.6206805Z 2025-05-07T19:55:10.6208257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6210283Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6210789Z ^ 2025-05-07T19:55:10.6211045Z 2025-05-07T19:55:10.6212432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6214213Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6214698Z ^ 2025-05-07T19:55:10.6214934Z 2025-05-07T19:55:10.6216279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6218031Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6218557Z ^ 2025-05-07T19:55:10.6218829Z 2025-05-07T19:55:10.6220350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6222854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6223927Z ^ 2025-05-07T19:55:10.6224207Z 2025-05-07T19:55:10.6224621Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:10.6225227Z 2025-05-07T19:55:10.6226806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.6229465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:10.6230640Z ^ 2025-05-07T19:55:10.6230993Z 2025-05-07T19:55:10.6232588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6234602Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:10.6235274Z ^ 2025-05-07T19:55:10.6235534Z 2025-05-07T19:55:10.6236931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6238854Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6239313Z ^ 2025-05-07T19:55:10.6239575Z 2025-05-07T19:55:10.6241166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6242921Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6243403Z ^ 2025-05-07T19:55:10.6243669Z 2025-05-07T19:55:10.6245090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:10.6246901Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:10.6247399Z ^ 2025-05-07T19:55:10.6247647Z 2025-05-07T19:55:17.4718062Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:17.4741478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4744108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4745837Z ^ 2025-05-07T19:55:17.4746115Z 2025-05-07T19:55:17.4746578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.4747263Z 2025-05-07T19:55:17.4749224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4752143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4753404Z ^ 2025-05-07T19:55:17.4753783Z 2025-05-07T19:55:17.4755484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4758138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4759297Z ^ 2025-05-07T19:55:17.4759552Z 2025-05-07T19:55:17.4760041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.4760614Z 2025-05-07T19:55:17.4762237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4765272Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4766466Z ^ 2025-05-07T19:55:17.4766839Z 2025-05-07T19:55:17.4768460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4771214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4772574Z ^ 2025-05-07T19:55:17.4772883Z 2025-05-07T19:55:17.4773353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.4774040Z 2025-05-07T19:55:17.4775836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4778636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4779893Z ^ 2025-05-07T19:55:17.4780248Z 2025-05-07T19:55:17.4781846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4784760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4785938Z ^ 2025-05-07T19:55:17.4786190Z 2025-05-07T19:55:17.4786659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.4787347Z 2025-05-07T19:55:17.4789017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4791940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4793147Z ^ 2025-05-07T19:55:17.4793529Z 2025-05-07T19:55:17.4795401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4798133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4799325Z ^ 2025-05-07T19:55:17.4799618Z 2025-05-07T19:55:17.4800047Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.4800731Z 2025-05-07T19:55:17.4802400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.4804771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.4805865Z ^ 2025-05-07T19:55:17.4806208Z 2025-05-07T19:55:18.4830668Z [238/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:18.4854039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4856926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4858516Z ^ 2025-05-07T19:55:18.4858786Z 2025-05-07T19:55:18.4859278Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.4859973Z 2025-05-07T19:55:18.4861703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4864499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4866065Z ^ 2025-05-07T19:55:18.4866462Z 2025-05-07T19:55:18.4867894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4870170Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.4870950Z ^ 2025-05-07T19:55:18.4871225Z 2025-05-07T19:55:18.4872790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4874687Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4875222Z ^ 2025-05-07T19:55:18.4875494Z 2025-05-07T19:55:18.4876860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4878603Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4879130Z ^ 2025-05-07T19:55:18.4879426Z 2025-05-07T19:55:18.4880877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4882770Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4883282Z ^ 2025-05-07T19:55:18.4883572Z 2025-05-07T19:55:18.4885100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4887715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4889083Z ^ 2025-05-07T19:55:18.4889341Z 2025-05-07T19:55:18.4889815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.4890460Z 2025-05-07T19:55:18.4892032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4895044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4896279Z ^ 2025-05-07T19:55:18.4896646Z 2025-05-07T19:55:18.4898405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4900523Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.4901262Z ^ 2025-05-07T19:55:18.4901541Z 2025-05-07T19:55:18.4903326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4905400Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4905962Z ^ 2025-05-07T19:55:18.4906287Z 2025-05-07T19:55:18.4907970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4909843Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4910368Z ^ 2025-05-07T19:55:18.4910652Z 2025-05-07T19:55:18.4912241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4914116Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4914659Z ^ 2025-05-07T19:55:18.4914902Z 2025-05-07T19:55:18.4916380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4918759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4919897Z ^ 2025-05-07T19:55:18.4920132Z 2025-05-07T19:55:18.4920561Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.4921179Z 2025-05-07T19:55:18.4922806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4925021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4926073Z ^ 2025-05-07T19:55:18.4926439Z 2025-05-07T19:55:18.4927953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4929950Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.4930782Z ^ 2025-05-07T19:55:18.4931067Z 2025-05-07T19:55:18.4932484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4934216Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4934741Z ^ 2025-05-07T19:55:18.4934987Z 2025-05-07T19:55:18.4936269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4937997Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4938637Z ^ 2025-05-07T19:55:18.4938902Z 2025-05-07T19:55:18.4940315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4942056Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4942578Z ^ 2025-05-07T19:55:18.4943034Z 2025-05-07T19:55:18.4944489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4947075Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4948415Z ^ 2025-05-07T19:55:18.4948653Z 2025-05-07T19:55:18.4949057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.4949720Z 2025-05-07T19:55:18.4951298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4954017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4955138Z ^ 2025-05-07T19:55:18.4955496Z 2025-05-07T19:55:18.4956894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4958905Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.4959619Z ^ 2025-05-07T19:55:18.4959891Z 2025-05-07T19:55:18.4961366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4963224Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4963822Z ^ 2025-05-07T19:55:18.4964122Z 2025-05-07T19:55:18.4965927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4968019Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4968544Z ^ 2025-05-07T19:55:18.4968791Z 2025-05-07T19:55:18.4970184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4971998Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4972788Z ^ 2025-05-07T19:55:18.4973056Z 2025-05-07T19:55:18.4974496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4976937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4978066Z ^ 2025-05-07T19:55:18.4978311Z 2025-05-07T19:55:18.4978729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:18.4979342Z 2025-05-07T19:55:18.4980946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.4983660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:18.4984754Z ^ 2025-05-07T19:55:18.4985095Z 2025-05-07T19:55:18.4986834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4988798Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:18.4989525Z ^ 2025-05-07T19:55:18.4989809Z 2025-05-07T19:55:18.4991218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4993210Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4993777Z ^ 2025-05-07T19:55:18.4994031Z 2025-05-07T19:55:18.4995436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.4997273Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.4997805Z ^ 2025-05-07T19:55:18.4998074Z 2025-05-07T19:55:18.4999499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:18.5001284Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:18.5001804Z ^ 2025-05-07T19:55:18.5002091Z 2025-05-07T19:55:28.8802862Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:28.8827790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8830828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8850202Z ^ 2025-05-07T19:55:28.8850715Z 2025-05-07T19:55:28.8851200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:28.8851900Z 2025-05-07T19:55:28.8853688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8856495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8857717Z ^ 2025-05-07T19:55:28.8858115Z 2025-05-07T19:55:28.8859705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8861921Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:28.8862716Z ^ 2025-05-07T19:55:28.8863010Z 2025-05-07T19:55:28.8864967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8867031Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8867627Z ^ 2025-05-07T19:55:28.8867912Z 2025-05-07T19:55:28.8869495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8871613Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8872212Z ^ 2025-05-07T19:55:28.8872492Z 2025-05-07T19:55:28.8874059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8876442Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8877063Z ^ 2025-05-07T19:55:28.8877357Z 2025-05-07T19:55:28.8879083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8881784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8882990Z ^ 2025-05-07T19:55:28.8883465Z 2025-05-07T19:55:28.8883925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:28.8884604Z 2025-05-07T19:55:28.8886310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8888701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8889596Z ^ 2025-05-07T19:55:28.8889866Z 2025-05-07T19:55:28.8890985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8892721Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:28.8893473Z ^ 2025-05-07T19:55:28.8893760Z 2025-05-07T19:55:28.8895394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8897355Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8897972Z ^ 2025-05-07T19:55:28.8898270Z 2025-05-07T19:55:28.8899691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8901351Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8901851Z ^ 2025-05-07T19:55:28.8902091Z 2025-05-07T19:55:28.8903405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8905535Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8906171Z ^ 2025-05-07T19:55:28.8906498Z 2025-05-07T19:55:28.8908230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8910778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8912067Z ^ 2025-05-07T19:55:28.8912365Z 2025-05-07T19:55:28.8912828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:28.8913416Z 2025-05-07T19:55:28.8915088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8917909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8919193Z ^ 2025-05-07T19:55:28.8919577Z 2025-05-07T19:55:28.8921213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8923418Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:28.8924231Z ^ 2025-05-07T19:55:28.8924527Z 2025-05-07T19:55:28.8926122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8928375Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8928984Z ^ 2025-05-07T19:55:28.8929275Z 2025-05-07T19:55:28.8931046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8933168Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8933762Z ^ 2025-05-07T19:55:28.8934088Z 2025-05-07T19:55:28.8935781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8937934Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8938517Z ^ 2025-05-07T19:55:28.8938813Z 2025-05-07T19:55:28.8940628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8943512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8944789Z ^ 2025-05-07T19:55:28.8945067Z 2025-05-07T19:55:28.8945582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:28.8946299Z 2025-05-07T19:55:28.8948097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8950796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8952165Z ^ 2025-05-07T19:55:28.8952531Z 2025-05-07T19:55:28.8954126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8956392Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:28.8957148Z ^ 2025-05-07T19:55:28.8957467Z 2025-05-07T19:55:28.8959108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8961204Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8961786Z ^ 2025-05-07T19:55:28.8962112Z 2025-05-07T19:55:28.8963881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8966248Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8966819Z ^ 2025-05-07T19:55:28.8967108Z 2025-05-07T19:55:28.8968755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8970804Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8971402Z ^ 2025-05-07T19:55:28.8971694Z 2025-05-07T19:55:28.8973471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8976525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8977753Z ^ 2025-05-07T19:55:28.8978018Z 2025-05-07T19:55:28.8980915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:28.8981726Z 2025-05-07T19:55:28.8983415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:28.8986236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:28.8987487Z ^ 2025-05-07T19:55:28.8987891Z 2025-05-07T19:55:28.8989486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8991953Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:28.8992767Z ^ 2025-05-07T19:55:28.8993103Z 2025-05-07T19:55:28.8994785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.8996924Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.8997520Z ^ 2025-05-07T19:55:28.8997823Z 2025-05-07T19:55:28.8999522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.9001636Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.9002238Z ^ 2025-05-07T19:55:28.9002536Z 2025-05-07T19:55:28.9004254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:28.9006358Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:28.9006970Z ^ 2025-05-07T19:55:28.9007261Z 2025-05-07T19:55:30.0578596Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:30.0591748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0593262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0593934Z ^ 2025-05-07T19:55:30.0594192Z 2025-05-07T19:55:30.0594457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.0594849Z 2025-05-07T19:55:30.0595799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0597267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0597970Z ^ 2025-05-07T19:55:30.0598186Z 2025-05-07T19:55:30.0599059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0600233Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.0600692Z ^ 2025-05-07T19:55:30.0600863Z 2025-05-07T19:55:30.0601716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0602917Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0603277Z ^ 2025-05-07T19:55:30.0603448Z 2025-05-07T19:55:30.0604278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0605367Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0605720Z ^ 2025-05-07T19:55:30.0605887Z 2025-05-07T19:55:30.0606714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0607855Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0608184Z ^ 2025-05-07T19:55:30.0608389Z 2025-05-07T19:55:30.0609296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0610865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0611531Z ^ 2025-05-07T19:55:30.0611712Z 2025-05-07T19:55:30.0611971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.0612347Z 2025-05-07T19:55:30.0613252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0614747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0615443Z ^ 2025-05-07T19:55:30.0615653Z 2025-05-07T19:55:30.0616488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0617666Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.0618126Z ^ 2025-05-07T19:55:30.0618300Z 2025-05-07T19:55:30.0619133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0620212Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0620550Z ^ 2025-05-07T19:55:30.0620747Z 2025-05-07T19:55:30.0621577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0622661Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0622993Z ^ 2025-05-07T19:55:30.0623186Z 2025-05-07T19:55:30.0624014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0625088Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0625417Z ^ 2025-05-07T19:55:30.0625584Z 2025-05-07T19:55:30.0626503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0628020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0628705Z ^ 2025-05-07T19:55:30.0628864Z 2025-05-07T19:55:30.0629147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.0629527Z 2025-05-07T19:55:30.0630432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0632052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0632806Z ^ 2025-05-07T19:55:30.0633023Z 2025-05-07T19:55:30.0633860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0635162Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.0635597Z ^ 2025-05-07T19:55:30.0635796Z 2025-05-07T19:55:30.0636630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0637712Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0638047Z ^ 2025-05-07T19:55:30.0638243Z 2025-05-07T19:55:30.0639066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0640133Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0640480Z ^ 2025-05-07T19:55:30.0640649Z 2025-05-07T19:55:30.0641509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0642577Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0642929Z ^ 2025-05-07T19:55:30.0643098Z 2025-05-07T19:55:30.0643995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0645471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0646155Z ^ 2025-05-07T19:55:30.0646313Z 2025-05-07T19:55:30.0646577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.0646978Z 2025-05-07T19:55:30.0647883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0649369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0650041Z ^ 2025-05-07T19:55:30.0650279Z 2025-05-07T19:55:30.0651117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0652354Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.0652789Z ^ 2025-05-07T19:55:30.0652960Z 2025-05-07T19:55:30.0653825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0654931Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0655288Z ^ 2025-05-07T19:55:30.0655462Z 2025-05-07T19:55:30.0656293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0657418Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0657751Z ^ 2025-05-07T19:55:30.0657945Z 2025-05-07T19:55:30.0658772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0661146Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0661506Z ^ 2025-05-07T19:55:30.0661673Z 2025-05-07T19:55:30.0662609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0664072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0665083Z ^ 2025-05-07T19:55:30.0665251Z 2025-05-07T19:55:30.0665547Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:30.0665940Z 2025-05-07T19:55:30.0666856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0668346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:30.0669051Z ^ 2025-05-07T19:55:30.0669267Z 2025-05-07T19:55:30.0670111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0671302Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:30.0671842Z ^ 2025-05-07T19:55:30.0672050Z 2025-05-07T19:55:30.0672884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0673978Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0674318Z ^ 2025-05-07T19:55:30.0674518Z 2025-05-07T19:55:30.0675348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0676436Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0676764Z ^ 2025-05-07T19:55:30.0676937Z 2025-05-07T19:55:30.0677793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:30.0678998Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:30.0679351Z ^ 2025-05-07T19:55:30.0679513Z 2025-05-07T19:55:49.4260887Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:49.4283419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4286218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4287365Z ^ 2025-05-07T19:55:49.4287621Z 2025-05-07T19:55:49.4288090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:49.4288780Z 2025-05-07T19:55:49.4290513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4293216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4294412Z ^ 2025-05-07T19:55:49.4294795Z 2025-05-07T19:55:49.4296605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4299508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4300740Z ^ 2025-05-07T19:55:49.4301008Z 2025-05-07T19:55:49.4301490Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:49.4302168Z 2025-05-07T19:55:49.4303935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4306708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4308114Z ^ 2025-05-07T19:55:49.4308497Z 2025-05-07T19:55:49.4310176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4313219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4314421Z ^ 2025-05-07T19:55:49.4314675Z 2025-05-07T19:55:49.4315121Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:49.4315920Z 2025-05-07T19:55:49.4317599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4320188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4321415Z ^ 2025-05-07T19:55:49.4321789Z 2025-05-07T19:55:49.4323509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4326139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4327339Z ^ 2025-05-07T19:55:49.4327601Z 2025-05-07T19:55:49.4328088Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:49.4328772Z 2025-05-07T19:55:49.4330397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4333141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4334312Z ^ 2025-05-07T19:55:49.4334691Z 2025-05-07T19:55:49.4336287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4338973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4340089Z ^ 2025-05-07T19:55:49.4340367Z 2025-05-07T19:55:49.4340825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:49.4341480Z 2025-05-07T19:55:49.4343186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:49.4345981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:49.4347151Z ^ 2025-05-07T19:55:49.4347508Z 2025-05-07T19:55:50.0562706Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:50.0587699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0590434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0591720Z ^ 2025-05-07T19:55:50.0591983Z 2025-05-07T19:55:50.0592477Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.0593157Z 2025-05-07T19:55:50.0594800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0597843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0599035Z ^ 2025-05-07T19:55:50.0599399Z 2025-05-07T19:55:50.0600987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0602936Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0603520Z ^ 2025-05-07T19:55:50.0603814Z 2025-05-07T19:55:50.0605402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0607624Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0608175Z ^ 2025-05-07T19:55:50.0608504Z 2025-05-07T19:55:50.0610298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0612339Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0612887Z ^ 2025-05-07T19:55:50.0613218Z 2025-05-07T19:55:50.0614854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0617493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0618672Z ^ 2025-05-07T19:55:50.0618935Z 2025-05-07T19:55:50.0619408Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.0620065Z 2025-05-07T19:55:50.0621740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0624438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0625635Z ^ 2025-05-07T19:55:50.0626003Z 2025-05-07T19:55:50.0627608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0629803Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0630366Z ^ 2025-05-07T19:55:50.0630686Z 2025-05-07T19:55:50.0632430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0634434Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0634992Z ^ 2025-05-07T19:55:50.0635320Z 2025-05-07T19:55:50.0636875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0638886Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0639458Z ^ 2025-05-07T19:55:50.0639744Z 2025-05-07T19:55:50.0641412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0644242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0645431Z ^ 2025-05-07T19:55:50.0645681Z 2025-05-07T19:55:50.0646158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.0646817Z 2025-05-07T19:55:50.0648466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0651141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0652467Z ^ 2025-05-07T19:55:50.0652842Z 2025-05-07T19:55:50.0654534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0656546Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0657108Z ^ 2025-05-07T19:55:50.0657438Z 2025-05-07T19:55:50.0659022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0661021Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0661578Z ^ 2025-05-07T19:55:50.0661911Z 2025-05-07T19:55:50.0663502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0665751Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0666281Z ^ 2025-05-07T19:55:50.0666597Z 2025-05-07T19:55:50.0668270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0670924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0672214Z ^ 2025-05-07T19:55:50.0672475Z 2025-05-07T19:55:50.0672947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.0673633Z 2025-05-07T19:55:50.0675255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0677979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0679164Z ^ 2025-05-07T19:55:50.0679573Z 2025-05-07T19:55:50.0681161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0683182Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0683747Z ^ 2025-05-07T19:55:50.0684072Z 2025-05-07T19:55:50.0685608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0687849Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0688407Z ^ 2025-05-07T19:55:50.0688710Z 2025-05-07T19:55:50.0690330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0692317Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0692904Z ^ 2025-05-07T19:55:50.0693204Z 2025-05-07T19:55:50.0694880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0697683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0698875Z ^ 2025-05-07T19:55:50.0699133Z 2025-05-07T19:55:50.0699597Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:50.0700523Z 2025-05-07T19:55:50.0702202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:50.0704905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:50.0706107Z ^ 2025-05-07T19:55:50.0706500Z 2025-05-07T19:55:50.0708117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0710138Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0710694Z ^ 2025-05-07T19:55:50.0711024Z 2025-05-07T19:55:50.0712751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0714740Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0715290Z ^ 2025-05-07T19:55:50.0715590Z 2025-05-07T19:55:50.0717193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:50.0719179Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:50.0719768Z ^ 2025-05-07T19:55:50.0720064Z 2025-05-07T19:55:56.7496823Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:55:56.7519113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7521868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7523056Z ^ 2025-05-07T19:55:56.7523310Z 2025-05-07T19:55:56.7523767Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.7524418Z 2025-05-07T19:55:56.7526172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7528932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7530106Z ^ 2025-05-07T19:55:56.7530458Z 2025-05-07T19:55:56.7532030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7534679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7535779Z ^ 2025-05-07T19:55:56.7536032Z 2025-05-07T19:55:56.7536491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.7537148Z 2025-05-07T19:55:56.7538827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7541553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7542838Z ^ 2025-05-07T19:55:56.7543191Z 2025-05-07T19:55:56.7544875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7547905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7549095Z ^ 2025-05-07T19:55:56.7549352Z 2025-05-07T19:55:56.7549804Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.7550488Z 2025-05-07T19:55:56.7552306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7555062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7556488Z ^ 2025-05-07T19:55:56.7556871Z 2025-05-07T19:55:56.7558699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7561540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7562723Z ^ 2025-05-07T19:55:56.7562990Z 2025-05-07T19:55:56.7563437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.7564074Z 2025-05-07T19:55:56.7565889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7568366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7569542Z ^ 2025-05-07T19:55:56.7569876Z 2025-05-07T19:55:56.7571459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7574138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7575213Z ^ 2025-05-07T19:55:56.7575465Z 2025-05-07T19:55:56.7575850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:56.7576373Z 2025-05-07T19:55:56.7577896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:56.7580440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:56.7581579Z ^ 2025-05-07T19:55:56.7581950Z 2025-05-07T19:56:00.6420401Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:56:00.6442168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6444938Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6446063Z ^ 2025-05-07T19:56:00.6446313Z 2025-05-07T19:56:00.6446744Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:00.6447371Z 2025-05-07T19:56:00.6448583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6451065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6452179Z ^ 2025-05-07T19:56:00.6452553Z 2025-05-07T19:56:00.6454061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6456580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6457641Z ^ 2025-05-07T19:56:00.6457905Z 2025-05-07T19:56:00.6458323Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:00.6458940Z 2025-05-07T19:56:00.6460454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6463044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6464369Z ^ 2025-05-07T19:56:00.6465017Z 2025-05-07T19:56:00.6466528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6468979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6470062Z ^ 2025-05-07T19:56:00.6470300Z 2025-05-07T19:56:00.6470713Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:00.6471353Z 2025-05-07T19:56:00.6472939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6475725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6477021Z ^ 2025-05-07T19:56:00.6477397Z 2025-05-07T19:56:00.6478903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6481341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6482448Z ^ 2025-05-07T19:56:00.6482691Z 2025-05-07T19:56:00.6483138Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:00.6483747Z 2025-05-07T19:56:00.6485321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6487877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6488971Z ^ 2025-05-07T19:56:00.6489299Z 2025-05-07T19:56:00.6490838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6493264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6494404Z ^ 2025-05-07T19:56:00.6494653Z 2025-05-07T19:56:00.6495079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:00.6495670Z 2025-05-07T19:56:00.6497274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:00.6499793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:00.6500894Z ^ 2025-05-07T19:56:00.6501233Z 2025-05-07T19:56:03.7750996Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:03.7774592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7777338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7778519Z ^ 2025-05-07T19:56:03.7778804Z 2025-05-07T19:56:03.7779252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.7779918Z 2025-05-07T19:56:03.7781592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7784353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7785563Z ^ 2025-05-07T19:56:03.7785934Z 2025-05-07T19:56:03.7787644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7790211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7791572Z ^ 2025-05-07T19:56:03.7791823Z 2025-05-07T19:56:03.7792266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.7792980Z 2025-05-07T19:56:03.7794642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7797869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7799082Z ^ 2025-05-07T19:56:03.7799488Z 2025-05-07T19:56:03.7801179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7803944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7805215Z ^ 2025-05-07T19:56:03.7805497Z 2025-05-07T19:56:03.7805941Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.7806561Z 2025-05-07T19:56:03.7808466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7810926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7812117Z ^ 2025-05-07T19:56:03.7812460Z 2025-05-07T19:56:03.7813887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7816570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7817800Z ^ 2025-05-07T19:56:03.7818057Z 2025-05-07T19:56:03.7818518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.7819229Z 2025-05-07T19:56:03.7820961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7823628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7824813Z ^ 2025-05-07T19:56:03.7825180Z 2025-05-07T19:56:03.7826741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7829337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7830483Z ^ 2025-05-07T19:56:03.7830741Z 2025-05-07T19:56:03.7831200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:03.7831959Z 2025-05-07T19:56:03.7833531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:03.7836085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:03.7837170Z ^ 2025-05-07T19:56:03.7837534Z 2025-05-07T19:56:08.3132731Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:56:08.3155789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3158384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3159652Z ^ 2025-05-07T19:56:08.3159915Z 2025-05-07T19:56:08.3160374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.3161061Z 2025-05-07T19:56:08.3162824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3165857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3167073Z ^ 2025-05-07T19:56:08.3167432Z 2025-05-07T19:56:08.3168726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3170696Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3171565Z ^ 2025-05-07T19:56:08.3175294Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:08.3178190Z 2025-05-07T19:56:08.3179484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3181364Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3182393Z ^ 2025-05-07T19:56:08.3185653Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:08.3188455Z 2025-05-07T19:56:08.3189707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3191862Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3192780Z ^ 2025-05-07T19:56:08.3195954Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:08.3198982Z 2025-05-07T19:56:08.3200173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3202111Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3202971Z ^ 2025-05-07T19:56:08.3206461Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:08.3209661Z 2025-05-07T19:56:08.3210968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3212925Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3213823Z ^ 2025-05-07T19:56:08.3217275Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:08.3220638Z 2025-05-07T19:56:08.3221915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3223858Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3224752Z ^ 2025-05-07T19:56:08.3228385Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:08.3231757Z 2025-05-07T19:56:08.3233084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3234982Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3235913Z ^ 2025-05-07T19:56:08.3239435Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:08.3242539Z 2025-05-07T19:56:08.3243922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3245967Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3246856Z ^ 2025-05-07T19:56:08.3250350Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:08.3253416Z 2025-05-07T19:56:08.3254780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3256787Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3257645Z ^ 2025-05-07T19:56:08.3261219Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:08.3264522Z 2025-05-07T19:56:08.3266111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3267881Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3268716Z ^ 2025-05-07T19:56:08.3271888Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:08.3274933Z 2025-05-07T19:56:08.3276308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3278119Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3278935Z ^ 2025-05-07T19:56:08.3281947Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:08.3284808Z 2025-05-07T19:56:08.3286097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3288101Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3288983Z ^ 2025-05-07T19:56:08.3292263Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:08.3295154Z 2025-05-07T19:56:08.3296330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3298155Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3299045Z ^ 2025-05-07T19:56:08.3302232Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:08.3305434Z 2025-05-07T19:56:08.3306776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3308858Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3309738Z ^ 2025-05-07T19:56:08.3313156Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:08.3316439Z 2025-05-07T19:56:08.3317836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3319831Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3320748Z ^ 2025-05-07T19:56:08.3324266Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:08.3327541Z 2025-05-07T19:56:08.3328912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3330963Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3331921Z ^ 2025-05-07T19:56:08.3335369Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:08.3338470Z 2025-05-07T19:56:08.3339848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3341913Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3342851Z ^ 2025-05-07T19:56:08.3346371Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:08.3349738Z 2025-05-07T19:56:08.3351112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3353253Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3354170Z ^ 2025-05-07T19:56:08.3357548Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:08.3360836Z 2025-05-07T19:56:08.3362136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3364376Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3365529Z ^ 2025-05-07T19:56:08.3369018Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:08.3372314Z 2025-05-07T19:56:08.3373664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3375703Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3376655Z ^ 2025-05-07T19:56:08.3380199Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:08.3383533Z 2025-05-07T19:56:08.3384912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3386977Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3387936Z ^ 2025-05-07T19:56:08.3391069Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:08.3394366Z 2025-05-07T19:56:08.3395769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3400708Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3401640Z ^ 2025-05-07T19:56:08.3404625Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:08.3407861Z 2025-05-07T19:56:08.3409079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3411037Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3411956Z ^ 2025-05-07T19:56:08.3415529Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:08.3418725Z 2025-05-07T19:56:08.3420045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3422077Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3422959Z ^ 2025-05-07T19:56:08.3426475Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:08.3429610Z 2025-05-07T19:56:08.3431275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3434024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3435186Z ^ 2025-05-07T19:56:08.3435469Z 2025-05-07T19:56:08.3435929Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.3436554Z 2025-05-07T19:56:08.3438282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3440880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3442017Z ^ 2025-05-07T19:56:08.3442381Z 2025-05-07T19:56:08.3443555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3445715Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3446647Z ^ 2025-05-07T19:56:08.3450121Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:08.3453723Z 2025-05-07T19:56:08.3455064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3457131Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3458384Z ^ 2025-05-07T19:56:08.3461591Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:08.3465232Z 2025-05-07T19:56:08.3466458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3468429Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3469389Z ^ 2025-05-07T19:56:08.3472957Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:08.3476042Z 2025-05-07T19:56:08.3477368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3479438Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3480353Z ^ 2025-05-07T19:56:08.3483798Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:08.3487072Z 2025-05-07T19:56:08.3488393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3490707Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3491579Z ^ 2025-05-07T19:56:08.3495056Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:08.3498206Z 2025-05-07T19:56:08.3499543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3501808Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3502735Z ^ 2025-05-07T19:56:08.3506464Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:08.3509725Z 2025-05-07T19:56:08.3511090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3513218Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3514121Z ^ 2025-05-07T19:56:08.3517414Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:08.3520506Z 2025-05-07T19:56:08.3521790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3523714Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3524595Z ^ 2025-05-07T19:56:08.3527894Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:08.3531122Z 2025-05-07T19:56:08.3532507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3534441Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3535449Z ^ 2025-05-07T19:56:08.3538483Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:08.3541631Z 2025-05-07T19:56:08.3542985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3545213Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3546160Z ^ 2025-05-07T19:56:08.3549829Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:08.3553165Z 2025-05-07T19:56:08.3554470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3556545Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3557456Z ^ 2025-05-07T19:56:08.3561005Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:08.3564074Z 2025-05-07T19:56:08.3565733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3567833Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3568766Z ^ 2025-05-07T19:56:08.3572405Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:08.3575787Z 2025-05-07T19:56:08.3577157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3579277Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3580223Z ^ 2025-05-07T19:56:08.3584069Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:08.3587418Z 2025-05-07T19:56:08.3588779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3590896Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3592121Z ^ 2025-05-07T19:56:08.3595884Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:08.3598756Z 2025-05-07T19:56:08.3599996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3602052Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3603001Z ^ 2025-05-07T19:56:08.3606557Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:08.3609870Z 2025-05-07T19:56:08.3611247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3613168Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3613997Z ^ 2025-05-07T19:56:08.3616971Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:08.3619648Z 2025-05-07T19:56:08.3620986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3623001Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3623952Z ^ 2025-05-07T19:56:08.3627495Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:08.3630900Z 2025-05-07T19:56:08.3632379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3634395Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3635293Z ^ 2025-05-07T19:56:08.3638875Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:08.3642274Z 2025-05-07T19:56:08.3643640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3645703Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3646576Z ^ 2025-05-07T19:56:08.3650026Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:08.3653217Z 2025-05-07T19:56:08.3654461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3656553Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3657489Z ^ 2025-05-07T19:56:08.3661105Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:08.3664322Z 2025-05-07T19:56:08.3665916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3667810Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3668753Z ^ 2025-05-07T19:56:08.3672434Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:08.3675946Z 2025-05-07T19:56:08.3677143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3679033Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3679897Z ^ 2025-05-07T19:56:08.3683423Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:08.3686583Z 2025-05-07T19:56:08.3687816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3689674Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3690578Z ^ 2025-05-07T19:56:08.3693881Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:08.3696631Z 2025-05-07T19:56:08.3697873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3699729Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3700623Z ^ 2025-05-07T19:56:08.3704013Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:08.3707326Z 2025-05-07T19:56:08.3709093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3711903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3713062Z ^ 2025-05-07T19:56:08.3713310Z 2025-05-07T19:56:08.3713766Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.3714444Z 2025-05-07T19:56:08.3716048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3718786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3720002Z ^ 2025-05-07T19:56:08.3720411Z 2025-05-07T19:56:08.3721770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3723876Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3724816Z ^ 2025-05-07T19:56:08.3728651Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:08.3731970Z 2025-05-07T19:56:08.3733359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3735405Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3736374Z ^ 2025-05-07T19:56:08.3739791Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:08.3742718Z 2025-05-07T19:56:08.3743958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3745817Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3746785Z ^ 2025-05-07T19:56:08.3750389Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:08.3753884Z 2025-05-07T19:56:08.3755280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3757382Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3758353Z ^ 2025-05-07T19:56:08.3761917Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:08.3765522Z 2025-05-07T19:56:08.3766718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3768587Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3769464Z ^ 2025-05-07T19:56:08.3773022Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:08.3776560Z 2025-05-07T19:56:08.3778080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3780159Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3781116Z ^ 2025-05-07T19:56:08.3784428Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:08.3787116Z 2025-05-07T19:56:08.3788237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3790124Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3790856Z ^ 2025-05-07T19:56:08.3794180Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:08.3797267Z 2025-05-07T19:56:08.3798558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3800466Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3801290Z ^ 2025-05-07T19:56:08.3804656Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:08.3807830Z 2025-05-07T19:56:08.3808997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3810813Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3811673Z ^ 2025-05-07T19:56:08.3814898Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:08.3818332Z 2025-05-07T19:56:08.3819875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3821936Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3822886Z ^ 2025-05-07T19:56:08.3826458Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:08.3829624Z 2025-05-07T19:56:08.3830965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3833120Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3834052Z ^ 2025-05-07T19:56:08.3837271Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:08.3840215Z 2025-05-07T19:56:08.3841522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3843496Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3844438Z ^ 2025-05-07T19:56:08.3848015Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:08.3851450Z 2025-05-07T19:56:08.3852808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3854898Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3855864Z ^ 2025-05-07T19:56:08.3859324Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:08.3862632Z 2025-05-07T19:56:08.3863971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3866596Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3867505Z ^ 2025-05-07T19:56:08.3871091Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:08.3874448Z 2025-05-07T19:56:08.3875784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3877857Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3878766Z ^ 2025-05-07T19:56:08.3882254Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:08.3885555Z 2025-05-07T19:56:08.3886912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3888928Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3889840Z ^ 2025-05-07T19:56:08.3893266Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:08.3896521Z 2025-05-07T19:56:08.3897960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3899847Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3900780Z ^ 2025-05-07T19:56:08.3904160Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:08.3907272Z 2025-05-07T19:56:08.3908696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3910444Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3911631Z ^ 2025-05-07T19:56:08.3914473Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:08.3917012Z 2025-05-07T19:56:08.3918070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3919727Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3920474Z ^ 2025-05-07T19:56:08.3923464Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:08.3926235Z 2025-05-07T19:56:08.3927436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3929200Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3930010Z ^ 2025-05-07T19:56:08.3933136Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:08.3936032Z 2025-05-07T19:56:08.3937251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3939183Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3939969Z ^ 2025-05-07T19:56:08.3943530Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:08.3948098Z 2025-05-07T19:56:08.3949466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3951760Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3952708Z ^ 2025-05-07T19:56:08.3956504Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:08.3960073Z 2025-05-07T19:56:08.3961432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3963585Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3964541Z ^ 2025-05-07T19:56:08.3968750Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:08.3972114Z 2025-05-07T19:56:08.3973475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.3975583Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.3976516Z ^ 2025-05-07T19:56:08.3980162Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:08.3983545Z 2025-05-07T19:56:08.3985251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3988267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3989496Z ^ 2025-05-07T19:56:08.3989762Z 2025-05-07T19:56:08.3990279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.3990984Z 2025-05-07T19:56:08.3992851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.3995668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.3997058Z ^ 2025-05-07T19:56:08.3997447Z 2025-05-07T19:56:08.3998805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4000902Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4002020Z ^ 2025-05-07T19:56:08.4005363Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:08.4008559Z 2025-05-07T19:56:08.4009873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4011950Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4012865Z ^ 2025-05-07T19:56:08.4016378Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:08.4019646Z 2025-05-07T19:56:08.4020982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4023104Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4024019Z ^ 2025-05-07T19:56:08.4027539Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:08.4030740Z 2025-05-07T19:56:08.4032354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4034522Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4035408Z ^ 2025-05-07T19:56:08.4038691Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:08.4041715Z 2025-05-07T19:56:08.4042852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4044612Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4045353Z ^ 2025-05-07T19:56:08.4048863Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:08.4052404Z 2025-05-07T19:56:08.4053832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4055956Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4056927Z ^ 2025-05-07T19:56:08.4060656Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:08.4064135Z 2025-05-07T19:56:08.4065775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4067845Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4068769Z ^ 2025-05-07T19:56:08.4072454Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:08.4075739Z 2025-05-07T19:56:08.4077084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4079176Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4080297Z ^ 2025-05-07T19:56:08.4083899Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:08.4087233Z 2025-05-07T19:56:08.4088713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4090994Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4091982Z ^ 2025-05-07T19:56:08.4095968Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:08.4099428Z 2025-05-07T19:56:08.4100825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4102998Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4103976Z ^ 2025-05-07T19:56:08.4107527Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:08.4110735Z 2025-05-07T19:56:08.4112305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4114377Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4115289Z ^ 2025-05-07T19:56:08.4118838Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:08.4122110Z 2025-05-07T19:56:08.4123440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4125500Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4126428Z ^ 2025-05-07T19:56:08.4130104Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:08.4133361Z 2025-05-07T19:56:08.4134721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4136754Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4137759Z ^ 2025-05-07T19:56:08.4141310Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:08.4143995Z 2025-05-07T19:56:08.4145178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4147068Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4147909Z ^ 2025-05-07T19:56:08.4151167Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:08.4154917Z 2025-05-07T19:56:08.4156211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4178998Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4180032Z ^ 2025-05-07T19:56:08.4183601Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:08.4186821Z 2025-05-07T19:56:08.4188047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4189834Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4190677Z ^ 2025-05-07T19:56:08.4193928Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:08.4197216Z 2025-05-07T19:56:08.4198606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4200678Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4201663Z ^ 2025-05-07T19:56:08.4205962Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:08.4209385Z 2025-05-07T19:56:08.4210836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4213006Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4213992Z ^ 2025-05-07T19:56:08.4217701Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:08.4221121Z 2025-05-07T19:56:08.4222537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4224625Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4225769Z ^ 2025-05-07T19:56:08.4229399Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:08.4232816Z 2025-05-07T19:56:08.4234150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4236231Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4237181Z ^ 2025-05-07T19:56:08.4240745Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:08.4244184Z 2025-05-07T19:56:08.4245525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4247572Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4248489Z ^ 2025-05-07T19:56:08.4251990Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:08.4255210Z 2025-05-07T19:56:08.4256383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4257991Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4258865Z ^ 2025-05-07T19:56:08.4262333Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:08.4265976Z 2025-05-07T19:56:08.4267256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4269290Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4270166Z ^ 2025-05-07T19:56:08.4273944Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:08.4277248Z 2025-05-07T19:56:08.4278586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4280780Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4281675Z ^ 2025-05-07T19:56:08.4285276Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:08.4288732Z 2025-05-07T19:56:08.4290414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.4292796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.4293912Z ^ 2025-05-07T19:56:08.4294171Z 2025-05-07T19:56:08.4294670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:08.4295505Z 2025-05-07T19:56:08.4297203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:08.4300548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:08.4301855Z ^ 2025-05-07T19:56:08.4302244Z 2025-05-07T19:56:08.4303623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4305746Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4306721Z ^ 2025-05-07T19:56:08.4310378Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:08.4314062Z 2025-05-07T19:56:08.4315414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4317471Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4318362Z ^ 2025-05-07T19:56:08.4321889Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:08.4325167Z 2025-05-07T19:56:08.4326522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4328618Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4329548Z ^ 2025-05-07T19:56:08.4333075Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:08.4336479Z 2025-05-07T19:56:08.4337818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4339856Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4340940Z ^ 2025-05-07T19:56:08.4344478Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:08.4347760Z 2025-05-07T19:56:08.4349338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4351765Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4352655Z ^ 2025-05-07T19:56:08.4356057Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:08.4358909Z 2025-05-07T19:56:08.4360242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4362119Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4363022Z ^ 2025-05-07T19:56:08.4366691Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:08.4369789Z 2025-05-07T19:56:08.4371042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4373089Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4374053Z ^ 2025-05-07T19:56:08.4377613Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:08.4381123Z 2025-05-07T19:56:08.4382428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4384381Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4385305Z ^ 2025-05-07T19:56:08.4388771Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:08.4392283Z 2025-05-07T19:56:08.4393803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4395450Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4396130Z ^ 2025-05-07T19:56:08.4398640Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:08.4401141Z 2025-05-07T19:56:08.4402202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4403827Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4404569Z ^ 2025-05-07T19:56:08.4407150Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:08.4409818Z 2025-05-07T19:56:08.4410991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4412679Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4413442Z ^ 2025-05-07T19:56:08.4416424Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:08.4419320Z 2025-05-07T19:56:08.4420496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4422163Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4422970Z ^ 2025-05-07T19:56:08.4425892Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:08.4429042Z 2025-05-07T19:56:08.4430367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4432591Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4433472Z ^ 2025-05-07T19:56:08.4436701Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:08.4439805Z 2025-05-07T19:56:08.4441122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4443216Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4444117Z ^ 2025-05-07T19:56:08.4447527Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:08.4450911Z 2025-05-07T19:56:08.4452304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4454280Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4455221Z ^ 2025-05-07T19:56:08.4458126Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:08.4461243Z 2025-05-07T19:56:08.4462785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4465142Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4466049Z ^ 2025-05-07T19:56:08.4469584Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:08.4473272Z 2025-05-07T19:56:08.4474663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4476771Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4477980Z ^ 2025-05-07T19:56:08.4481687Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:08.4485128Z 2025-05-07T19:56:08.4486489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4488536Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4489441Z ^ 2025-05-07T19:56:08.4492518Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:08.4495697Z 2025-05-07T19:56:08.4497095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4499133Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4500093Z ^ 2025-05-07T19:56:08.4503656Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:08.4506935Z 2025-05-07T19:56:08.4508290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4510502Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4511613Z ^ 2025-05-07T19:56:08.4515204Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:08.4518765Z 2025-05-07T19:56:08.4520120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4522272Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4523140Z ^ 2025-05-07T19:56:08.4527510Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:08.4530831Z 2025-05-07T19:56:08.4532126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4534052Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4534987Z ^ 2025-05-07T19:56:08.4538546Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:08.4541826Z 2025-05-07T19:56:08.4543097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4545171Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4546260Z ^ 2025-05-07T19:56:08.4549986Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:08.4553536Z 2025-05-07T19:56:08.4554883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:08.4557145Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:08.4558039Z ^ 2025-05-07T19:56:08.4561675Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:08.4565236Z 2025-05-07T19:56:16.8206992Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:16.8231775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8234522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8235748Z ^ 2025-05-07T19:56:16.8236044Z 2025-05-07T19:56:16.8236504Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.8237451Z 2025-05-07T19:56:16.8239196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8241835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8243033Z ^ 2025-05-07T19:56:16.8243408Z 2025-05-07T19:56:16.8245092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8246960Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8247676Z ^ 2025-05-07T19:56:16.8247966Z 2025-05-07T19:56:16.8249552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8251727Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8252329Z ^ 2025-05-07T19:56:16.8252603Z 2025-05-07T19:56:16.8254221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8256210Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8256798Z ^ 2025-05-07T19:56:16.8257135Z 2025-05-07T19:56:16.8258816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8261364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8262499Z ^ 2025-05-07T19:56:16.8262796Z 2025-05-07T19:56:16.8263257Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.8263908Z 2025-05-07T19:56:16.8265836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8268553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8269789Z ^ 2025-05-07T19:56:16.8270077Z 2025-05-07T19:56:16.8271633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8273299Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8273853Z ^ 2025-05-07T19:56:16.8274130Z 2025-05-07T19:56:16.8275642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8277296Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8277786Z ^ 2025-05-07T19:56:16.8278036Z 2025-05-07T19:56:16.8279448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8281581Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8282126Z ^ 2025-05-07T19:56:16.8282437Z 2025-05-07T19:56:16.8284004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8286572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8287749Z ^ 2025-05-07T19:56:16.8288043Z 2025-05-07T19:56:16.8288499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.8289194Z 2025-05-07T19:56:16.8291041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8293743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8295223Z ^ 2025-05-07T19:56:16.8295611Z 2025-05-07T19:56:16.8297203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8299295Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8299921Z ^ 2025-05-07T19:56:16.8300234Z 2025-05-07T19:56:16.8301814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8303725Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8304292Z ^ 2025-05-07T19:56:16.8304610Z 2025-05-07T19:56:16.8306226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8308159Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8308700Z ^ 2025-05-07T19:56:16.8309018Z 2025-05-07T19:56:16.8310660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8313484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8314688Z ^ 2025-05-07T19:56:16.8314967Z 2025-05-07T19:56:16.8315407Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.8316034Z 2025-05-07T19:56:16.8317685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8320347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8321554Z ^ 2025-05-07T19:56:16.8321905Z 2025-05-07T19:56:16.8323465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8325681Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8326274Z ^ 2025-05-07T19:56:16.8326591Z 2025-05-07T19:56:16.8328202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8330239Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8330797Z ^ 2025-05-07T19:56:16.8331131Z 2025-05-07T19:56:16.8332691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8334796Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8335346Z ^ 2025-05-07T19:56:16.8335684Z 2025-05-07T19:56:16.8337304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8340167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8341359Z ^ 2025-05-07T19:56:16.8341629Z 2025-05-07T19:56:16.8342079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:16.8342744Z 2025-05-07T19:56:16.8344410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:16.8347096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:16.8348323Z ^ 2025-05-07T19:56:16.8348670Z 2025-05-07T19:56:16.8350224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8352372Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8352900Z ^ 2025-05-07T19:56:16.8353225Z 2025-05-07T19:56:16.8354815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8356851Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8357387Z ^ 2025-05-07T19:56:16.8357710Z 2025-05-07T19:56:16.8359283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:16.8361325Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:16.8361866Z ^ 2025-05-07T19:56:16.8362158Z 2025-05-07T19:56:30.6195741Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:30.6218026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6220710Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6221802Z ^ 2025-05-07T19:56:30.6222048Z 2025-05-07T19:56:30.6222482Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.6223120Z 2025-05-07T19:56:30.6224678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6227331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6228469Z ^ 2025-05-07T19:56:30.6228814Z 2025-05-07T19:56:30.6230315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6232584Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.6233317Z ^ 2025-05-07T19:56:30.6233590Z 2025-05-07T19:56:30.6235095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6237035Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6237602Z ^ 2025-05-07T19:56:30.6238182Z 2025-05-07T19:56:30.6239676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6241596Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6242167Z ^ 2025-05-07T19:56:30.6242464Z 2025-05-07T19:56:30.6243944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6245959Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6246531Z ^ 2025-05-07T19:56:30.6246818Z 2025-05-07T19:56:30.6248607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6251303Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6252719Z ^ 2025-05-07T19:56:30.6252988Z 2025-05-07T19:56:30.6253451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.6254103Z 2025-05-07T19:56:30.6255721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6258435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6259665Z ^ 2025-05-07T19:56:30.6260048Z 2025-05-07T19:56:30.6261594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6263789Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.6264552Z ^ 2025-05-07T19:56:30.6265227Z 2025-05-07T19:56:30.6266950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6268843Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6269387Z ^ 2025-05-07T19:56:30.6269711Z 2025-05-07T19:56:30.6271230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6273218Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6273733Z ^ 2025-05-07T19:56:30.6273972Z 2025-05-07T19:56:30.6275270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6277124Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6277688Z ^ 2025-05-07T19:56:30.6277964Z 2025-05-07T19:56:30.6279553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6282094Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6283563Z ^ 2025-05-07T19:56:30.6283817Z 2025-05-07T19:56:30.6284254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.6284768Z 2025-05-07T19:56:30.6286226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6288762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6289929Z ^ 2025-05-07T19:56:30.6290326Z 2025-05-07T19:56:30.6291841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6294239Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.6294983Z ^ 2025-05-07T19:56:30.6295271Z 2025-05-07T19:56:30.6297040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6299160Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6299748Z ^ 2025-05-07T19:56:30.6300035Z 2025-05-07T19:56:30.6301623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6303608Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6304183Z ^ 2025-05-07T19:56:30.6304444Z 2025-05-07T19:56:30.6305988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6308001Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6308554Z ^ 2025-05-07T19:56:30.6308853Z 2025-05-07T19:56:30.6310567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6313416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6314509Z ^ 2025-05-07T19:56:30.6314743Z 2025-05-07T19:56:30.6315171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.6315778Z 2025-05-07T19:56:30.6317391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6320076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6321319Z ^ 2025-05-07T19:56:30.6321692Z 2025-05-07T19:56:30.6323250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6325406Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.6330259Z ^ 2025-05-07T19:56:30.6330549Z 2025-05-07T19:56:30.6332132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6334072Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6334688Z ^ 2025-05-07T19:56:30.6335007Z 2025-05-07T19:56:30.6336634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6338515Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6339089Z ^ 2025-05-07T19:56:30.6339503Z 2025-05-07T19:56:30.6340990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6343094Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6343678Z ^ 2025-05-07T19:56:30.6344004Z 2025-05-07T19:56:30.6345722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6348273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6349329Z ^ 2025-05-07T19:56:30.6349626Z 2025-05-07T19:56:30.6350051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.6350668Z 2025-05-07T19:56:30.6352406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.6355025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.6356148Z ^ 2025-05-07T19:56:30.6356532Z 2025-05-07T19:56:30.6358016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6360156Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.6360917Z ^ 2025-05-07T19:56:30.6361254Z 2025-05-07T19:56:30.6362689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6364610Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6365448Z ^ 2025-05-07T19:56:30.6365743Z 2025-05-07T19:56:30.6367288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6369142Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6369691Z ^ 2025-05-07T19:56:30.6369949Z 2025-05-07T19:56:30.6371413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.6373299Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.6374118Z ^ 2025-05-07T19:56:30.6374385Z 2025-05-07T19:56:36.0042458Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:36.0067522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0070270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0071588Z ^ 2025-05-07T19:56:36.0071846Z 2025-05-07T19:56:36.0072331Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:36.0073169Z 2025-05-07T19:56:36.0074847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0077698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0078889Z ^ 2025-05-07T19:56:36.0079244Z 2025-05-07T19:56:36.0080782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0083117Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:36.0083850Z ^ 2025-05-07T19:56:36.0084137Z 2025-05-07T19:56:36.0085685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0087626Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0088172Z ^ 2025-05-07T19:56:36.0088468Z 2025-05-07T19:56:36.0090024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0092258Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0092812Z ^ 2025-05-07T19:56:36.0093086Z 2025-05-07T19:56:36.0094986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0097106Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0097701Z ^ 2025-05-07T19:56:36.0097979Z 2025-05-07T19:56:36.0099664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0102386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0103601Z ^ 2025-05-07T19:56:36.0103863Z 2025-05-07T19:56:36.0104318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:36.0105022Z 2025-05-07T19:56:36.0106673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0109454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0110674Z ^ 2025-05-07T19:56:36.0111064Z 2025-05-07T19:56:36.0112807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0114969Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:36.0115860Z ^ 2025-05-07T19:56:36.0116154Z 2025-05-07T19:56:36.0117646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0119499Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0120043Z ^ 2025-05-07T19:56:36.0120333Z 2025-05-07T19:56:36.0121846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0123764Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0124344Z ^ 2025-05-07T19:56:36.0124629Z 2025-05-07T19:56:36.0126152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0131339Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0131894Z ^ 2025-05-07T19:56:36.0132165Z 2025-05-07T19:56:36.0133788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0136432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0137593Z ^ 2025-05-07T19:56:36.0137845Z 2025-05-07T19:56:36.0138257Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:36.0139103Z 2025-05-07T19:56:36.0140648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0143627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0144828Z ^ 2025-05-07T19:56:36.0145211Z 2025-05-07T19:56:36.0146824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0149159Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:36.0149949Z ^ 2025-05-07T19:56:36.0150265Z 2025-05-07T19:56:36.0152047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0153996Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0154577Z ^ 2025-05-07T19:56:36.0154858Z 2025-05-07T19:56:36.0156355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0158381Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0158980Z ^ 2025-05-07T19:56:36.0159262Z 2025-05-07T19:56:36.0160815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0162856Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0163508Z ^ 2025-05-07T19:56:36.0163803Z 2025-05-07T19:56:36.0165643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0168454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0169604Z ^ 2025-05-07T19:56:36.0169901Z 2025-05-07T19:56:36.0170336Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:36.0170983Z 2025-05-07T19:56:36.0172620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0175654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0176861Z ^ 2025-05-07T19:56:36.0177206Z 2025-05-07T19:56:36.0178710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0180734Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:36.0181489Z ^ 2025-05-07T19:56:36.0181770Z 2025-05-07T19:56:36.0183399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0185579Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0186170Z ^ 2025-05-07T19:56:36.0186625Z 2025-05-07T19:56:36.0188375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0190350Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0190900Z ^ 2025-05-07T19:56:36.0191194Z 2025-05-07T19:56:36.0192880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0194881Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0195411Z ^ 2025-05-07T19:56:36.0195711Z 2025-05-07T19:56:36.0197333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0199994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0201150Z ^ 2025-05-07T19:56:36.0201401Z 2025-05-07T19:56:36.0201859Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:36.0202546Z 2025-05-07T19:56:36.0204187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.0207141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:36.0208311Z ^ 2025-05-07T19:56:36.0208681Z 2025-05-07T19:56:36.0210171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0212282Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:36.0212989Z ^ 2025-05-07T19:56:36.0213285Z 2025-05-07T19:56:36.0214763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0216649Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0217180Z ^ 2025-05-07T19:56:36.0217487Z 2025-05-07T19:56:36.0218990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0220979Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0221497Z ^ 2025-05-07T19:56:36.0221763Z 2025-05-07T19:56:36.0223268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.0225144Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:36.0225698Z ^ 2025-05-07T19:56:36.0225965Z 2025-05-07T19:56:42.2397034Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:42.2422819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2425833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2427074Z ^ 2025-05-07T19:56:42.2427340Z 2025-05-07T19:56:42.2427839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.2428626Z 2025-05-07T19:56:42.2430442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2433911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2435255Z ^ 2025-05-07T19:56:42.2435663Z 2025-05-07T19:56:42.2437351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2439647Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.2440663Z ^ 2025-05-07T19:56:42.2440988Z 2025-05-07T19:56:42.2442642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2444797Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2445446Z ^ 2025-05-07T19:56:42.2445941Z 2025-05-07T19:56:42.2447608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2449736Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2450354Z ^ 2025-05-07T19:56:42.2450699Z 2025-05-07T19:56:42.2452333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2454471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2455104Z ^ 2025-05-07T19:56:42.2455445Z 2025-05-07T19:56:42.2457241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2460160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2461466Z ^ 2025-05-07T19:56:42.2461754Z 2025-05-07T19:56:42.2462273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.2463004Z 2025-05-07T19:56:42.2465097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2467959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2469006Z ^ 2025-05-07T19:56:42.2469411Z 2025-05-07T19:56:42.2471095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2473370Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.2474164Z ^ 2025-05-07T19:56:42.2474485Z 2025-05-07T19:56:42.2476153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2478270Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2479180Z ^ 2025-05-07T19:56:42.2479532Z 2025-05-07T19:56:42.2481193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2483301Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2483800Z ^ 2025-05-07T19:56:42.2484115Z 2025-05-07T19:56:42.2485764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2487862Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2488579Z ^ 2025-05-07T19:56:42.2488841Z 2025-05-07T19:56:42.2490641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2493670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2494999Z ^ 2025-05-07T19:56:42.2495289Z 2025-05-07T19:56:42.2495809Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.2496536Z 2025-05-07T19:56:42.2498348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2501292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2502509Z ^ 2025-05-07T19:56:42.2502963Z 2025-05-07T19:56:42.2504598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2506818Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.2507605Z ^ 2025-05-07T19:56:42.2507962Z 2025-05-07T19:56:42.2509609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2511910Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2512413Z ^ 2025-05-07T19:56:42.2512691Z 2025-05-07T19:56:42.2514379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2516481Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2517135Z ^ 2025-05-07T19:56:42.2517463Z 2025-05-07T19:56:42.2519147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2521258Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2521912Z ^ 2025-05-07T19:56:42.2522224Z 2025-05-07T19:56:42.2523993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2527120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2528447Z ^ 2025-05-07T19:56:42.2528730Z 2025-05-07T19:56:42.2529234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.2529989Z 2025-05-07T19:56:42.2531786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2534722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2536033Z ^ 2025-05-07T19:56:42.2536660Z 2025-05-07T19:56:42.2538305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2540670Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.2541539Z ^ 2025-05-07T19:56:42.2541863Z 2025-05-07T19:56:42.2543693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2545851Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2546511Z ^ 2025-05-07T19:56:42.2546832Z 2025-05-07T19:56:42.2548475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2550454Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2551088Z ^ 2025-05-07T19:56:42.2551584Z 2025-05-07T19:56:42.2553233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2555371Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2555996Z ^ 2025-05-07T19:56:42.2556310Z 2025-05-07T19:56:42.2558106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2561006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2562347Z ^ 2025-05-07T19:56:42.2562645Z 2025-05-07T19:56:42.2563169Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:42.2563897Z 2025-05-07T19:56:42.2565979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.2568932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:42.2570271Z ^ 2025-05-07T19:56:42.2570673Z 2025-05-07T19:56:42.2572324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2574601Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:42.2575713Z ^ 2025-05-07T19:56:42.2576062Z 2025-05-07T19:56:42.2577722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2579812Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2580434Z ^ 2025-05-07T19:56:42.2580782Z 2025-05-07T19:56:42.2582439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2584531Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2585360Z ^ 2025-05-07T19:56:42.2585671Z 2025-05-07T19:56:42.2587303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:42.2589391Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:42.2590047Z ^ 2025-05-07T19:56:42.2590610Z 2025-05-07T19:56:43.9211645Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:43.9236296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9239435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9240629Z ^ 2025-05-07T19:56:43.9240920Z 2025-05-07T19:56:43.9241364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9242044Z 2025-05-07T19:56:43.9243779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9246480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9248142Z ^ 2025-05-07T19:56:43.9248534Z 2025-05-07T19:56:43.9250401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9252684Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9253515Z ^ 2025-05-07T19:56:43.9253823Z 2025-05-07T19:56:43.9255480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9257527Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9258134Z ^ 2025-05-07T19:56:43.9258432Z 2025-05-07T19:56:43.9260060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9262138Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9262693Z ^ 2025-05-07T19:56:43.9263018Z 2025-05-07T19:56:43.9264649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9266966Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9267532Z ^ 2025-05-07T19:56:43.9267847Z 2025-05-07T19:56:43.9269384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9272302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9273486Z ^ 2025-05-07T19:56:43.9273750Z 2025-05-07T19:56:43.9274229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9274932Z 2025-05-07T19:56:43.9276517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9279160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9280370Z ^ 2025-05-07T19:56:43.9280742Z 2025-05-07T19:56:43.9282118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9284560Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9285393Z ^ 2025-05-07T19:56:43.9285711Z 2025-05-07T19:56:43.9287379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9289245Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9289781Z ^ 2025-05-07T19:56:43.9290091Z 2025-05-07T19:56:43.9291687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9294012Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9294529Z ^ 2025-05-07T19:56:43.9294779Z 2025-05-07T19:56:43.9296394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9298226Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9298802Z ^ 2025-05-07T19:56:43.9299084Z 2025-05-07T19:56:43.9300718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9303116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9304251Z ^ 2025-05-07T19:56:43.9304516Z 2025-05-07T19:56:43.9304971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9305626Z 2025-05-07T19:56:43.9307435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9310127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9311330Z ^ 2025-05-07T19:56:43.9311871Z 2025-05-07T19:56:43.9313494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9315752Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9316533Z ^ 2025-05-07T19:56:43.9316853Z 2025-05-07T19:56:43.9318448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9320498Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9321073Z ^ 2025-05-07T19:56:43.9321364Z 2025-05-07T19:56:43.9322994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9325051Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9325674Z ^ 2025-05-07T19:56:43.9325974Z 2025-05-07T19:56:43.9327623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9329769Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9330381Z ^ 2025-05-07T19:56:43.9330616Z 2025-05-07T19:56:43.9332258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9334831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9335998Z ^ 2025-05-07T19:56:43.9336247Z 2025-05-07T19:56:43.9336823Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9337646Z 2025-05-07T19:56:43.9339337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9342311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9343621Z ^ 2025-05-07T19:56:43.9343993Z 2025-05-07T19:56:43.9345337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9347348Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9348061Z ^ 2025-05-07T19:56:43.9348344Z 2025-05-07T19:56:43.9349891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9351960Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9352486Z ^ 2025-05-07T19:56:43.9352748Z 2025-05-07T19:56:43.9354277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9356244Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9356819Z ^ 2025-05-07T19:56:43.9357093Z 2025-05-07T19:56:43.9358669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9360706Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9361248Z ^ 2025-05-07T19:56:43.9361527Z 2025-05-07T19:56:43.9362996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9366040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9367373Z ^ 2025-05-07T19:56:43.9367619Z 2025-05-07T19:56:43.9368035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:43.9368618Z 2025-05-07T19:56:43.9370290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.9373052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:43.9374242Z ^ 2025-05-07T19:56:43.9374582Z 2025-05-07T19:56:43.9375978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9377765Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:43.9378396Z ^ 2025-05-07T19:56:43.9378623Z 2025-05-07T19:56:43.9379898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9382044Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9382606Z ^ 2025-05-07T19:56:43.9382893Z 2025-05-07T19:56:43.9384631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9386561Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9387352Z ^ 2025-05-07T19:56:43.9387613Z 2025-05-07T19:56:43.9389010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:43.9390919Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:43.9391605Z ^ 2025-05-07T19:56:43.9391883Z 2025-05-07T19:56:45.5577710Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:45.5602233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5605094Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5606617Z ^ 2025-05-07T19:56:45.5606893Z 2025-05-07T19:56:45.5607402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.5608108Z 2025-05-07T19:56:45.5610135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5613184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5614463Z ^ 2025-05-07T19:56:45.5615063Z 2025-05-07T19:56:45.5616672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5619401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5620498Z ^ 2025-05-07T19:56:45.5620808Z 2025-05-07T19:56:45.5621259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.5621946Z 2025-05-07T19:56:45.5623658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5626463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5627765Z ^ 2025-05-07T19:56:45.5628139Z 2025-05-07T19:56:45.5629790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5632852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5633980Z ^ 2025-05-07T19:56:45.5634434Z 2025-05-07T19:56:45.5634919Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.5635660Z 2025-05-07T19:56:45.5637392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5640310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5641624Z ^ 2025-05-07T19:56:45.5641986Z 2025-05-07T19:56:45.5643874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5646687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5647856Z ^ 2025-05-07T19:56:45.5648147Z 2025-05-07T19:56:45.5648658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.5649336Z 2025-05-07T19:56:45.5651095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5654203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5655609Z ^ 2025-05-07T19:56:45.5655986Z 2025-05-07T19:56:45.5657802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5660479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5661682Z ^ 2025-05-07T19:56:45.5661961Z 2025-05-07T19:56:45.5662422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:45.5663124Z 2025-05-07T19:56:45.5665085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.5667900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:45.5669081Z ^ 2025-05-07T19:56:45.5669450Z 2025-05-07T19:56:47.8919669Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:47.8943246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8945806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8947340Z ^ 2025-05-07T19:56:47.8947615Z 2025-05-07T19:56:47.8948048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:47.8948717Z 2025-05-07T19:56:47.8950335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8953112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8954267Z ^ 2025-05-07T19:56:47.8954683Z 2025-05-07T19:56:47.8956265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8958868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8960042Z ^ 2025-05-07T19:56:47.8960287Z 2025-05-07T19:56:47.8960735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:47.8961385Z 2025-05-07T19:56:47.8962964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8966099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8967371Z ^ 2025-05-07T19:56:47.8967781Z 2025-05-07T19:56:47.8969401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8971864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8972999Z ^ 2025-05-07T19:56:47.8973294Z 2025-05-07T19:56:47.8973719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:47.8974366Z 2025-05-07T19:56:47.8976011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8978895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8980075Z ^ 2025-05-07T19:56:47.8980441Z 2025-05-07T19:56:47.8982059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8984615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8985769Z ^ 2025-05-07T19:56:47.8986013Z 2025-05-07T19:56:47.8986436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:47.8987339Z 2025-05-07T19:56:47.8988886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8991893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8993171Z ^ 2025-05-07T19:56:47.8993588Z 2025-05-07T19:56:47.8995181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.8997625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.8998761Z ^ 2025-05-07T19:56:47.8999050Z 2025-05-07T19:56:47.8999482Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:47.9000142Z 2025-05-07T19:56:47.9001750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.9004322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:47.9005486Z ^ 2025-05-07T19:56:47.9005844Z 2025-05-07T19:56:49.7581331Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:49.7608438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7611198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7612419Z ^ 2025-05-07T19:56:49.7612674Z 2025-05-07T19:56:49.7613155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.7613848Z 2025-05-07T19:56:49.7615562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7618271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7619391Z ^ 2025-05-07T19:56:49.7619767Z 2025-05-07T19:56:49.7621420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7624091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7625243Z ^ 2025-05-07T19:56:49.7625522Z 2025-05-07T19:56:49.7625975Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.7626660Z 2025-05-07T19:56:49.7628374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7630839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7632202Z ^ 2025-05-07T19:56:49.7632556Z 2025-05-07T19:56:49.7634145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7636810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7638063Z ^ 2025-05-07T19:56:49.7638325Z 2025-05-07T19:56:49.7638984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.7639657Z 2025-05-07T19:56:49.7641366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7644102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7645258Z ^ 2025-05-07T19:56:49.7645586Z 2025-05-07T19:56:49.7647048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7649892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7651101Z ^ 2025-05-07T19:56:49.7651393Z 2025-05-07T19:56:49.7651859Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.7652554Z 2025-05-07T19:56:49.7654423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7656946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7658155Z ^ 2025-05-07T19:56:49.7658494Z 2025-05-07T19:56:49.7659984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7662645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7663855Z ^ 2025-05-07T19:56:49.7664120Z 2025-05-07T19:56:49.7664598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.7665571Z 2025-05-07T19:56:49.7667309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7670081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.7671208Z ^ 2025-05-07T19:56:49.7671707Z 2025-05-07T19:56:52.3858298Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:52.3881802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3884420Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3885663Z ^ 2025-05-07T19:56:52.3885933Z 2025-05-07T19:56:52.3886425Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.3887096Z 2025-05-07T19:56:52.3888771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3891345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3892567Z ^ 2025-05-07T19:56:52.3892938Z 2025-05-07T19:56:52.3894502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3897203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3898406Z ^ 2025-05-07T19:56:52.3898674Z 2025-05-07T19:56:52.3899105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.3899757Z 2025-05-07T19:56:52.3901391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3904026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3905235Z ^ 2025-05-07T19:56:52.3905613Z 2025-05-07T19:56:52.3907251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3910157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3911319Z ^ 2025-05-07T19:56:52.3911726Z 2025-05-07T19:56:52.3912166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.3912843Z 2025-05-07T19:56:52.3914487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3917192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3918633Z ^ 2025-05-07T19:56:52.3919020Z 2025-05-07T19:56:52.3920582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3923501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3924697Z ^ 2025-05-07T19:56:52.3924982Z 2025-05-07T19:56:52.3925397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.3926059Z 2025-05-07T19:56:52.3927675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3930309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3931538Z ^ 2025-05-07T19:56:52.3931911Z 2025-05-07T19:56:52.3933485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3936113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3937281Z ^ 2025-05-07T19:56:52.3937542Z 2025-05-07T19:56:52.3938009Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.3938638Z 2025-05-07T19:56:52.3940247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3942904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.3944090Z ^ 2025-05-07T19:56:52.3944488Z 2025-05-07T19:56:52.6986687Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:52.7010894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7013680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7014875Z ^ 2025-05-07T19:56:52.7015150Z 2025-05-07T19:56:52.7015637Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.7016249Z 2025-05-07T19:56:52.7033801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7036984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7038190Z ^ 2025-05-07T19:56:52.7038596Z 2025-05-07T19:56:52.7040238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7043002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7044234Z ^ 2025-05-07T19:56:52.7044491Z 2025-05-07T19:56:52.7044917Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.7045627Z 2025-05-07T19:56:52.7047344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7050158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7051595Z ^ 2025-05-07T19:56:52.7051997Z 2025-05-07T19:56:52.7053729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7056429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7057733Z ^ 2025-05-07T19:56:52.7058031Z 2025-05-07T19:56:52.7058512Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.7059193Z 2025-05-07T19:56:52.7061132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7064120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7065678Z ^ 2025-05-07T19:56:52.7066063Z 2025-05-07T19:56:52.7068149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7070948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7072189Z ^ 2025-05-07T19:56:52.7072456Z 2025-05-07T19:56:52.7072920Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.7073631Z 2025-05-07T19:56:52.7075349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7078076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7079316Z ^ 2025-05-07T19:56:52.7079723Z 2025-05-07T19:56:52.7081354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7084005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7085202Z ^ 2025-05-07T19:56:52.7085472Z 2025-05-07T19:56:52.7085961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:52.7086786Z 2025-05-07T19:56:52.7088731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.7091609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:52.7092898Z ^ 2025-05-07T19:56:52.7093285Z 2025-05-07T19:56:55.7384479Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:55.7409682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7412788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7414028Z ^ 2025-05-07T19:56:55.7414286Z 2025-05-07T19:56:55.7414747Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.7415405Z 2025-05-07T19:56:55.7417023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7419799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7421039Z ^ 2025-05-07T19:56:55.7421446Z 2025-05-07T19:56:55.7423168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7426300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7427542Z ^ 2025-05-07T19:56:55.7427847Z 2025-05-07T19:56:55.7428328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.7429041Z 2025-05-07T19:56:55.7430786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7434096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7435391Z ^ 2025-05-07T19:56:55.7435778Z 2025-05-07T19:56:55.7437505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7440207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7441412Z ^ 2025-05-07T19:56:55.7441821Z 2025-05-07T19:56:55.7442282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.7442964Z 2025-05-07T19:56:55.7444582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7447559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7448800Z ^ 2025-05-07T19:56:55.7449170Z 2025-05-07T19:56:55.7450824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7453801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7455020Z ^ 2025-05-07T19:56:55.7455276Z 2025-05-07T19:56:55.7455777Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.7456482Z 2025-05-07T19:56:55.7458227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7461082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7462216Z ^ 2025-05-07T19:56:55.7462594Z 2025-05-07T19:56:55.7464336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7467685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7468946Z ^ 2025-05-07T19:56:55.7469203Z 2025-05-07T19:56:55.7469646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:55.7470348Z 2025-05-07T19:56:55.7472130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.7474862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:55.7476124Z ^ 2025-05-07T19:56:55.7476507Z 2025-05-07T19:57:02.9687060Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:02.9709634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9712427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9713552Z ^ 2025-05-07T19:57:02.9713859Z 2025-05-07T19:57:02.9714332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9714975Z 2025-05-07T19:57:02.9716524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9719186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9720370Z ^ 2025-05-07T19:57:02.9720735Z 2025-05-07T19:57:02.9722317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9724816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9726005Z ^ 2025-05-07T19:57:02.9726516Z 2025-05-07T19:57:02.9726937Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9727582Z 2025-05-07T19:57:02.9729243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9731855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9733065Z ^ 2025-05-07T19:57:02.9733412Z 2025-05-07T19:57:02.9735046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9737842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9739058Z ^ 2025-05-07T19:57:02.9739325Z 2025-05-07T19:57:02.9739805Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9740479Z 2025-05-07T19:57:02.9742363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9744994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9746187Z ^ 2025-05-07T19:57:02.9746549Z 2025-05-07T19:57:02.9748136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9750547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9751610Z ^ 2025-05-07T19:57:02.9751868Z 2025-05-07T19:57:02.9752223Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9752738Z 2025-05-07T19:57:02.9754068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9756227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9757204Z ^ 2025-05-07T19:57:02.9757485Z 2025-05-07T19:57:02.9758789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9760893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9761862Z ^ 2025-05-07T19:57:02.9762070Z 2025-05-07T19:57:02.9762420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9762962Z 2025-05-07T19:57:02.9764279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9766763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9768061Z ^ 2025-05-07T19:57:02.9768364Z 2025-05-07T19:57:07.0761431Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:07.0786185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0788789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0789967Z ^ 2025-05-07T19:57:07.0790197Z 2025-05-07T19:57:07.0790648Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:07.0791255Z 2025-05-07T19:57:07.0792927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0795724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0796939Z ^ 2025-05-07T19:57:07.0797344Z 2025-05-07T19:57:07.0798856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0801800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0803139Z ^ 2025-05-07T19:57:07.0803426Z 2025-05-07T19:57:07.0803882Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:07.0804570Z 2025-05-07T19:57:07.0806343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0809063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0810467Z ^ 2025-05-07T19:57:07.0810852Z 2025-05-07T19:57:07.0812527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0815588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0816957Z ^ 2025-05-07T19:57:07.0817214Z 2025-05-07T19:57:07.0817641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:07.0818261Z 2025-05-07T19:57:07.0819867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0822435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0823433Z ^ 2025-05-07T19:57:07.0823778Z 2025-05-07T19:57:07.0825293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0827809Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0828848Z ^ 2025-05-07T19:57:07.0829096Z 2025-05-07T19:57:07.0829514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:07.0830094Z 2025-05-07T19:57:07.0831871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0834607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0835834Z ^ 2025-05-07T19:57:07.0836241Z 2025-05-07T19:57:07.0837966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0840751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0842101Z ^ 2025-05-07T19:57:07.0842372Z 2025-05-07T19:57:07.0842852Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:07.0843755Z 2025-05-07T19:57:07.0845540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.0848646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:07.0849923Z ^ 2025-05-07T19:57:07.0850307Z 2025-05-07T19:57:09.0260480Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:09.0283147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0285812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0286996Z ^ 2025-05-07T19:57:09.0287255Z 2025-05-07T19:57:09.0287674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:09.0288315Z 2025-05-07T19:57:09.0289919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0292509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0293996Z ^ 2025-05-07T19:57:09.0294375Z 2025-05-07T19:57:09.0296060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0298711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0299889Z ^ 2025-05-07T19:57:09.0300156Z 2025-05-07T19:57:09.0300630Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:09.0301298Z 2025-05-07T19:57:09.0302940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0305756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0306934Z ^ 2025-05-07T19:57:09.0307585Z 2025-05-07T19:57:09.0309149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0311752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0312893Z ^ 2025-05-07T19:57:09.0313151Z 2025-05-07T19:57:09.0313595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:09.0314268Z 2025-05-07T19:57:09.0315869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0318380Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0319551Z ^ 2025-05-07T19:57:09.0319911Z 2025-05-07T19:57:09.0321500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0324077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0325238Z ^ 2025-05-07T19:57:09.0325498Z 2025-05-07T19:57:09.0325971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:09.0326607Z 2025-05-07T19:57:09.0328184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0330739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0331886Z ^ 2025-05-07T19:57:09.0332286Z 2025-05-07T19:57:09.0333798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0336416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0337705Z ^ 2025-05-07T19:57:09.0337979Z 2025-05-07T19:57:09.0338387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:09.0338994Z 2025-05-07T19:57:09.0340629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:09.0343214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:09.0344349Z ^ 2025-05-07T19:57:09.0344711Z 2025-05-07T19:57:13.7623496Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:13.7646696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7649414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7650545Z ^ 2025-05-07T19:57:13.7650800Z 2025-05-07T19:57:13.7651291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.7651984Z 2025-05-07T19:57:13.7653654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7656324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7657592Z ^ 2025-05-07T19:57:13.7657974Z 2025-05-07T19:57:13.7659636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7662137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7663477Z ^ 2025-05-07T19:57:13.7663740Z 2025-05-07T19:57:13.7664138Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.7665001Z 2025-05-07T19:57:13.7666765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7669457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7670659Z ^ 2025-05-07T19:57:13.7671052Z 2025-05-07T19:57:13.7672789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7675264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7676385Z ^ 2025-05-07T19:57:13.7676630Z 2025-05-07T19:57:13.7676981Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.7677545Z 2025-05-07T19:57:13.7679225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7681914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7683120Z ^ 2025-05-07T19:57:13.7683481Z 2025-05-07T19:57:13.7685063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7687448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7688698Z ^ 2025-05-07T19:57:13.7688966Z 2025-05-07T19:57:13.7689426Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.7690079Z 2025-05-07T19:57:13.7691443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7693754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7694872Z ^ 2025-05-07T19:57:13.7695253Z 2025-05-07T19:57:13.7696887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7699852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7701011Z ^ 2025-05-07T19:57:13.7701266Z 2025-05-07T19:57:13.7701728Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:13.7702259Z 2025-05-07T19:57:13.7703851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:13.7706383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:13.7707563Z ^ 2025-05-07T19:57:13.7707933Z 2025-05-07T19:57:14.2468228Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:14.2491472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2494500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2495846Z ^ 2025-05-07T19:57:14.2496091Z 2025-05-07T19:57:14.2496518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2497071Z 2025-05-07T19:57:14.2498467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2500664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2501655Z ^ 2025-05-07T19:57:14.2501963Z 2025-05-07T19:57:14.2503532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2505804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2506852Z ^ 2025-05-07T19:57:14.2507251Z 2025-05-07T19:57:14.2507684Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2508262Z 2025-05-07T19:57:14.2509697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2512297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2513623Z ^ 2025-05-07T19:57:14.2514005Z 2025-05-07T19:57:14.2515617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2518194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2519329Z ^ 2025-05-07T19:57:14.2519602Z 2025-05-07T19:57:14.2520033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2520692Z 2025-05-07T19:57:14.2522486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2525043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2526197Z ^ 2025-05-07T19:57:14.2526554Z 2025-05-07T19:57:14.2528324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2530927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2532061Z ^ 2025-05-07T19:57:14.2532316Z 2025-05-07T19:57:14.2532742Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2533588Z 2025-05-07T19:57:14.2535256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2538090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2539252Z ^ 2025-05-07T19:57:14.2539629Z 2025-05-07T19:57:14.2541261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2543991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2545120Z ^ 2025-05-07T19:57:14.2545355Z 2025-05-07T19:57:14.2545783Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2546485Z 2025-05-07T19:57:14.2548062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2550644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2551879Z ^ 2025-05-07T19:57:14.2552197Z 2025-05-07T19:57:14.4101442Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:14.4126254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4128771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4129819Z ^ 2025-05-07T19:57:14.4130051Z 2025-05-07T19:57:14.4130643Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.4131253Z 2025-05-07T19:57:14.4132751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4135676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4136886Z ^ 2025-05-07T19:57:14.4137209Z 2025-05-07T19:57:14.4138969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4141297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4142256Z ^ 2025-05-07T19:57:14.4142500Z 2025-05-07T19:57:14.4142875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.4143418Z 2025-05-07T19:57:14.4144808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4147050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4148084Z ^ 2025-05-07T19:57:14.4148417Z 2025-05-07T19:57:14.4149839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4152277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4153288Z ^ 2025-05-07T19:57:14.4153504Z 2025-05-07T19:57:14.4153903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.4154576Z 2025-05-07T19:57:14.4156117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4158684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4159775Z ^ 2025-05-07T19:57:14.4160132Z 2025-05-07T19:57:14.4161666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4164212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4165618Z ^ 2025-05-07T19:57:14.4165893Z 2025-05-07T19:57:14.4166322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.4167269Z 2025-05-07T19:57:14.4168887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4171434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4172595Z ^ 2025-05-07T19:57:14.4172944Z 2025-05-07T19:57:14.4174529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4177527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4178724Z ^ 2025-05-07T19:57:14.4178980Z 2025-05-07T19:57:14.4179420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.4180123Z 2025-05-07T19:57:14.4181960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.4184565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.4185723Z ^ 2025-05-07T19:57:14.4186214Z 2025-05-07T19:57:21.4882620Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:21.4907025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4909722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4910930Z ^ 2025-05-07T19:57:21.4911598Z 2025-05-07T19:57:21.4912062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:21.4912765Z 2025-05-07T19:57:21.4914669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4917547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4918734Z ^ 2025-05-07T19:57:21.4919095Z 2025-05-07T19:57:21.4920689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4923291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4924455Z ^ 2025-05-07T19:57:21.4924721Z 2025-05-07T19:57:21.4925163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:21.4925731Z 2025-05-07T19:57:21.4927218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4929654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4930792Z ^ 2025-05-07T19:57:21.4931174Z 2025-05-07T19:57:21.4932721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4935397Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4936598Z ^ 2025-05-07T19:57:21.4936854Z 2025-05-07T19:57:21.4937334Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:21.4938027Z 2025-05-07T19:57:21.4939708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4942454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4943675Z ^ 2025-05-07T19:57:21.4944079Z 2025-05-07T19:57:21.4945742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4948612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4949812Z ^ 2025-05-07T19:57:21.4950096Z 2025-05-07T19:57:21.4950544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:21.4951216Z 2025-05-07T19:57:21.4953055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4955776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4957108Z ^ 2025-05-07T19:57:21.4957481Z 2025-05-07T19:57:21.4959167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4961982Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4963185Z ^ 2025-05-07T19:57:21.4963444Z 2025-05-07T19:57:21.4963901Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:21.4964587Z 2025-05-07T19:57:21.4966694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.4969672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:21.4970929Z ^ 2025-05-07T19:57:21.4971321Z 2025-05-07T19:57:35.6503025Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:35.6524687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6527662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6528938Z ^ 2025-05-07T19:57:35.6529269Z 2025-05-07T19:57:35.6529684Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.6530625Z 2025-05-07T19:57:35.6532044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6534733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6535761Z ^ 2025-05-07T19:57:35.6536084Z 2025-05-07T19:57:35.6537563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6539833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6540877Z ^ 2025-05-07T19:57:35.6541100Z 2025-05-07T19:57:35.6541471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.6542029Z 2025-05-07T19:57:35.6543459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6545821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6546760Z ^ 2025-05-07T19:57:35.6547041Z 2025-05-07T19:57:35.6548441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6550798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6551895Z ^ 2025-05-07T19:57:35.6552087Z 2025-05-07T19:57:35.6552457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.6552955Z 2025-05-07T19:57:35.6554532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6556856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6558065Z ^ 2025-05-07T19:57:35.6558377Z 2025-05-07T19:57:35.6559817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6562080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6563085Z ^ 2025-05-07T19:57:35.6563335Z 2025-05-07T19:57:35.6563749Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.6564432Z 2025-05-07T19:57:35.6566146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6568772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6569782Z ^ 2025-05-07T19:57:35.6570107Z 2025-05-07T19:57:35.6571804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6574141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6575220Z ^ 2025-05-07T19:57:35.6575469Z 2025-05-07T19:57:35.6575852Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:35.6576487Z 2025-05-07T19:57:35.6577948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6580338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:35.6581388Z ^ 2025-05-07T19:57:35.6581756Z 2025-05-07T19:57:37.1232881Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:37.1258369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1261485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1262896Z ^ 2025-05-07T19:57:37.1263210Z 2025-05-07T19:57:37.1263704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.1264418Z 2025-05-07T19:57:37.1266455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1269404Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1270733Z ^ 2025-05-07T19:57:37.1271128Z 2025-05-07T19:57:37.1273021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1275934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1277247Z ^ 2025-05-07T19:57:37.1277521Z 2025-05-07T19:57:37.1278001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.1278756Z 2025-05-07T19:57:37.1280554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1283492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1284769Z ^ 2025-05-07T19:57:37.1285151Z 2025-05-07T19:57:37.1286997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1289893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1291189Z ^ 2025-05-07T19:57:37.1291459Z 2025-05-07T19:57:37.1291961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.1292681Z 2025-05-07T19:57:37.1294711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1297633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1298924Z ^ 2025-05-07T19:57:37.1299320Z 2025-05-07T19:57:37.1301110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1303986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1306663Z ^ 2025-05-07T19:57:37.1306932Z 2025-05-07T19:57:37.1307419Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.1308138Z 2025-05-07T19:57:37.1310150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1313183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1314480Z ^ 2025-05-07T19:57:37.1314867Z 2025-05-07T19:57:37.1316671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1319539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1320824Z ^ 2025-05-07T19:57:37.1321093Z 2025-05-07T19:57:37.1321600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:37.1322332Z 2025-05-07T19:57:37.1324152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1327088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:37.1328368Z ^ 2025-05-07T19:57:37.1328785Z 2025-05-07T19:57:39.7908326Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:39.7932543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7935480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7936716Z ^ 2025-05-07T19:57:39.7937108Z 2025-05-07T19:57:39.7937559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.7938281Z 2025-05-07T19:57:39.7940040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7942737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7943942Z ^ 2025-05-07T19:57:39.7944338Z 2025-05-07T19:57:39.7945995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7948743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7949960Z ^ 2025-05-07T19:57:39.7950229Z 2025-05-07T19:57:39.7950872Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.7951651Z 2025-05-07T19:57:39.7953265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7955933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7957109Z ^ 2025-05-07T19:57:39.7957463Z 2025-05-07T19:57:39.7959122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7961750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7963137Z ^ 2025-05-07T19:57:39.7963381Z 2025-05-07T19:57:39.7963816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.7964476Z 2025-05-07T19:57:39.7966392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7969022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7970214Z ^ 2025-05-07T19:57:39.7970559Z 2025-05-07T19:57:39.7972101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7974953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7976128Z ^ 2025-05-07T19:57:39.7976361Z 2025-05-07T19:57:39.7977037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.7977677Z 2025-05-07T19:57:39.7979375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7981992Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7983200Z ^ 2025-05-07T19:57:39.7983565Z 2025-05-07T19:57:39.7985181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7987913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7989112Z ^ 2025-05-07T19:57:39.7989395Z 2025-05-07T19:57:39.7989840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.7990500Z 2025-05-07T19:57:39.7992356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7995240Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.7996517Z ^ 2025-05-07T19:57:39.7996896Z 2025-05-07T19:57:39.9568257Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:39.9592974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9595848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9597025Z ^ 2025-05-07T19:57:39.9597300Z 2025-05-07T19:57:39.9597748Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.9598438Z 2025-05-07T19:57:39.9600162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9602888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9604109Z ^ 2025-05-07T19:57:39.9604461Z 2025-05-07T19:57:39.9606139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9608844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9610040Z ^ 2025-05-07T19:57:39.9610294Z 2025-05-07T19:57:39.9610748Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.9611569Z 2025-05-07T19:57:39.9613240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9615910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9617233Z ^ 2025-05-07T19:57:39.9617617Z 2025-05-07T19:57:39.9619431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9622076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9623213Z ^ 2025-05-07T19:57:39.9623448Z 2025-05-07T19:57:39.9623903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.9624736Z 2025-05-07T19:57:39.9626234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9628837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9629949Z ^ 2025-05-07T19:57:39.9630298Z 2025-05-07T19:57:39.9632342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9634996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9635902Z ^ 2025-05-07T19:57:39.9636102Z 2025-05-07T19:57:39.9636470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.9636990Z 2025-05-07T19:57:39.9638256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9640761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9641840Z ^ 2025-05-07T19:57:39.9642173Z 2025-05-07T19:57:39.9643807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9646478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9647678Z ^ 2025-05-07T19:57:39.9647942Z 2025-05-07T19:57:39.9648420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:39.9649048Z 2025-05-07T19:57:39.9650473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.9652675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:39.9653652Z ^ 2025-05-07T19:57:39.9653999Z 2025-05-07T19:57:41.2526846Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:41.2551361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2554312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2555584Z ^ 2025-05-07T19:57:41.2555851Z 2025-05-07T19:57:41.2556322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.2557030Z 2025-05-07T19:57:41.2558812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2561698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2562935Z ^ 2025-05-07T19:57:41.2563335Z 2025-05-07T19:57:41.2565482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2568357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2569720Z ^ 2025-05-07T19:57:41.2569984Z 2025-05-07T19:57:41.2570436Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.2571159Z 2025-05-07T19:57:41.2572899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2575941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2577170Z ^ 2025-05-07T19:57:41.2577560Z 2025-05-07T19:57:41.2579011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2580875Z int error_code = 0; 2025-05-07T19:57:41.2581315Z ^ 2025-05-07T19:57:41.2581541Z 2025-05-07T19:57:41.2583033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2585009Z int64_t error_value; 2025-05-07T19:57:41.2585466Z ^ 2025-05-07T19:57:41.2585709Z 2025-05-07T19:57:41.2587168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2588982Z int error_code = 0; 2025-05-07T19:57:41.2589616Z ^ 2025-05-07T19:57:41.2589821Z 2025-05-07T19:57:41.2591328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2593247Z int64_t error_value; 2025-05-07T19:57:41.2593728Z ^ 2025-05-07T19:57:41.2593966Z 2025-05-07T19:57:41.2595414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2597292Z int error_code = 0; 2025-05-07T19:57:41.2597761Z ^ 2025-05-07T19:57:41.2597977Z 2025-05-07T19:57:41.2599432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2601287Z int64_t error_value; 2025-05-07T19:57:41.2601751Z ^ 2025-05-07T19:57:41.2602019Z 2025-05-07T19:57:41.2603448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2605309Z int error_code = 0; 2025-05-07T19:57:41.2605767Z ^ 2025-05-07T19:57:41.2605987Z 2025-05-07T19:57:41.2607484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2609317Z int64_t error_value; 2025-05-07T19:57:41.2609803Z ^ 2025-05-07T19:57:41.2610043Z 2025-05-07T19:57:41.2611772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2614552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2615795Z ^ 2025-05-07T19:57:41.2616057Z 2025-05-07T19:57:41.2616522Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.2617235Z 2025-05-07T19:57:41.2618960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2621896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2623101Z ^ 2025-05-07T19:57:41.2623504Z 2025-05-07T19:57:41.2624973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2626834Z int error_code = 0; 2025-05-07T19:57:41.2627288Z ^ 2025-05-07T19:57:41.2627507Z 2025-05-07T19:57:41.2628973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2630889Z int64_t error_value; 2025-05-07T19:57:41.2631361Z ^ 2025-05-07T19:57:41.2631706Z 2025-05-07T19:57:41.2633151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2634943Z int error_code = 0; 2025-05-07T19:57:41.2635403Z ^ 2025-05-07T19:57:41.2635748Z 2025-05-07T19:57:41.2637207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2639049Z int64_t error_value; 2025-05-07T19:57:41.2639522Z ^ 2025-05-07T19:57:41.2639758Z 2025-05-07T19:57:41.2641192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2643007Z int error_code = 0; 2025-05-07T19:57:41.2643433Z ^ 2025-05-07T19:57:41.2643646Z 2025-05-07T19:57:41.2645092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2646800Z int64_t error_value; 2025-05-07T19:57:41.2647175Z ^ 2025-05-07T19:57:41.2647352Z 2025-05-07T19:57:41.2648766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2650232Z int error_code = 0; 2025-05-07T19:57:41.2650649Z ^ 2025-05-07T19:57:41.2650851Z 2025-05-07T19:57:41.2652306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2654177Z int64_t error_value; 2025-05-07T19:57:41.2654649Z ^ 2025-05-07T19:57:41.2654884Z 2025-05-07T19:57:41.2656636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2659458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2660674Z ^ 2025-05-07T19:57:41.2660930Z 2025-05-07T19:57:41.2661398Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.2662097Z 2025-05-07T19:57:41.2663884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2667162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2668430Z ^ 2025-05-07T19:57:41.2668817Z 2025-05-07T19:57:41.2670327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2672256Z int error_code = 0; 2025-05-07T19:57:41.2672733Z ^ 2025-05-07T19:57:41.2672944Z 2025-05-07T19:57:41.2674409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2676486Z int64_t error_value; 2025-05-07T19:57:41.2676973Z ^ 2025-05-07T19:57:41.2677213Z 2025-05-07T19:57:41.2678673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2680533Z int error_code = 0; 2025-05-07T19:57:41.2680982Z ^ 2025-05-07T19:57:41.2681443Z 2025-05-07T19:57:41.2682928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2684803Z int64_t error_value; 2025-05-07T19:57:41.2685278Z ^ 2025-05-07T19:57:41.2685549Z 2025-05-07T19:57:41.2686999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2688867Z int error_code = 0; 2025-05-07T19:57:41.2689334Z ^ 2025-05-07T19:57:41.2689552Z 2025-05-07T19:57:41.2691029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2692900Z int64_t error_value; 2025-05-07T19:57:41.2693388Z ^ 2025-05-07T19:57:41.2693625Z 2025-05-07T19:57:41.2695116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2696961Z int error_code = 0; 2025-05-07T19:57:41.2697451Z ^ 2025-05-07T19:57:41.2697665Z 2025-05-07T19:57:41.2699134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2701015Z int64_t error_value; 2025-05-07T19:57:41.2701473Z ^ 2025-05-07T19:57:41.2701734Z 2025-05-07T19:57:41.2703457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2706278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2707493Z ^ 2025-05-07T19:57:41.2707780Z 2025-05-07T19:57:41.2708251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:41.2708944Z 2025-05-07T19:57:41.2710702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.2713799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:41.2715058Z ^ 2025-05-07T19:57:41.2715434Z 2025-05-07T19:57:41.2716900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2718780Z int error_code = 0; 2025-05-07T19:57:41.2719255Z ^ 2025-05-07T19:57:41.2719473Z 2025-05-07T19:57:41.2720960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2722822Z int64_t error_value; 2025-05-07T19:57:41.2723380Z ^ 2025-05-07T19:57:41.2723650Z 2025-05-07T19:57:41.2725117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2726984Z int error_code = 0; 2025-05-07T19:57:41.2727439Z ^ 2025-05-07T19:57:41.2727659Z 2025-05-07T19:57:41.2729260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2731151Z int64_t error_value; 2025-05-07T19:57:41.2731633Z ^ 2025-05-07T19:57:41.2731873Z 2025-05-07T19:57:41.2733361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2735194Z int error_code = 0; 2025-05-07T19:57:41.2735674Z ^ 2025-05-07T19:57:41.2735898Z 2025-05-07T19:57:41.2737382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2739260Z int64_t error_value; 2025-05-07T19:57:41.2739770Z ^ 2025-05-07T19:57:41.2740016Z 2025-05-07T19:57:41.2741478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:41.2743379Z int error_code = 0; 2025-05-07T19:57:41.2743832Z ^ 2025-05-07T19:57:41.2744071Z 2025-05-07T19:57:41.2745532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:41.2747435Z int64_t error_value; 2025-05-07T19:57:41.2747907Z ^ 2025-05-07T19:57:41.2748147Z 2025-05-07T19:57:45.4456330Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:45.4480770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4483607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4484880Z ^ 2025-05-07T19:57:45.4485143Z 2025-05-07T19:57:45.4485649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4486339Z 2025-05-07T19:57:45.4488109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4490925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4492170Z ^ 2025-05-07T19:57:45.4492577Z 2025-05-07T19:57:45.4494289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4497095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4498297Z ^ 2025-05-07T19:57:45.4498584Z 2025-05-07T19:57:45.4499060Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4499763Z 2025-05-07T19:57:45.4501608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4504402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4505682Z ^ 2025-05-07T19:57:45.4506069Z 2025-05-07T19:57:45.4507483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:45.4509519Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:45.4510107Z ^ 2025-05-07T19:57:45.4510376Z 2025-05-07T19:57:45.4512284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4514800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4515788Z ^ 2025-05-07T19:57:45.4516020Z 2025-05-07T19:57:45.4516431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4517288Z 2025-05-07T19:57:45.4519043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4522084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4523334Z ^ 2025-05-07T19:57:45.4523729Z 2025-05-07T19:57:45.4525144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:45.4526974Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:45.4527520Z ^ 2025-05-07T19:57:45.4527788Z 2025-05-07T19:57:45.4529536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4532335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4533571Z ^ 2025-05-07T19:57:45.4533818Z 2025-05-07T19:57:45.4534328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4535037Z 2025-05-07T19:57:45.4536753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4539574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4540803Z ^ 2025-05-07T19:57:45.4541212Z 2025-05-07T19:57:45.4542615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:45.4544465Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:45.4545041Z ^ 2025-05-07T19:57:45.4545346Z 2025-05-07T19:57:45.4547056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4549862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4551107Z ^ 2025-05-07T19:57:45.4551403Z 2025-05-07T19:57:45.4552175Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:45.4553052Z 2025-05-07T19:57:45.4554840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.4557881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:45.4559269Z ^ 2025-05-07T19:57:45.4559644Z 2025-05-07T19:57:45.4561202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:45.4563056Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:45.4563641Z ^ 2025-05-07T19:57:45.4563912Z 2025-05-07T19:57:46.4211019Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:46.4236213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4239123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4240430Z ^ 2025-05-07T19:57:46.4240785Z 2025-05-07T19:57:46.4241255Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4242187Z 2025-05-07T19:57:46.4244005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4246929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4248184Z ^ 2025-05-07T19:57:46.4248598Z 2025-05-07T19:57:46.4250507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4253336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4254700Z ^ 2025-05-07T19:57:46.4254962Z 2025-05-07T19:57:46.4255461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4256164Z 2025-05-07T19:57:46.4258028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4260898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4262194Z ^ 2025-05-07T19:57:46.4262738Z 2025-05-07T19:57:46.4264518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4267765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4269062Z ^ 2025-05-07T19:57:46.4269341Z 2025-05-07T19:57:46.4269831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4270567Z 2025-05-07T19:57:46.4272479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4275393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4276670Z ^ 2025-05-07T19:57:46.4277066Z 2025-05-07T19:57:46.4278892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4281794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4283125Z ^ 2025-05-07T19:57:46.4283396Z 2025-05-07T19:57:46.4283877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4284616Z 2025-05-07T19:57:46.4286444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4289566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4290850Z ^ 2025-05-07T19:57:46.4291411Z 2025-05-07T19:57:46.4293160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4295959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4297217Z ^ 2025-05-07T19:57:46.4297492Z 2025-05-07T19:57:46.4297993Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.4298824Z 2025-05-07T19:57:46.4300650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.4303851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.4305138Z ^ 2025-05-07T19:57:46.4305556Z 2025-05-07T19:57:46.8761135Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:57:46.8785704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8788842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8790212Z ^ 2025-05-07T19:57:46.8790473Z 2025-05-07T19:57:46.8790961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.8791746Z 2025-05-07T19:57:46.8793489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8796262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8797536Z ^ 2025-05-07T19:57:46.8798076Z 2025-05-07T19:57:46.8799713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8802652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8803855Z ^ 2025-05-07T19:57:46.8804114Z 2025-05-07T19:57:46.8804594Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.8805280Z 2025-05-07T19:57:46.8806860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8809723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8811022Z ^ 2025-05-07T19:57:46.8811394Z 2025-05-07T19:57:46.8813118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8816025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8817268Z ^ 2025-05-07T19:57:46.8817578Z 2025-05-07T19:57:46.8818071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.8818797Z 2025-05-07T19:57:46.8820566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8823403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8824699Z ^ 2025-05-07T19:57:46.8825081Z 2025-05-07T19:57:46.8826786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8829788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8830984Z ^ 2025-05-07T19:57:46.8831257Z 2025-05-07T19:57:46.8831857Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.8832569Z 2025-05-07T19:57:46.8834329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8837380Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8838620Z ^ 2025-05-07T19:57:46.8839015Z 2025-05-07T19:57:46.8840708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8843480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8844604Z ^ 2025-05-07T19:57:46.8844878Z 2025-05-07T19:57:46.8845425Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.8846028Z 2025-05-07T19:57:46.8847642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.8850484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.8851733Z ^ 2025-05-07T19:57:46.8852099Z 2025-05-07T19:57:50.2178925Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:50.2204494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2207035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2208317Z ^ 2025-05-07T19:57:50.2208585Z 2025-05-07T19:57:50.2209077Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.2209952Z 2025-05-07T19:57:50.2211672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2214690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2215847Z ^ 2025-05-07T19:57:50.2216206Z 2025-05-07T19:57:50.2218069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2220682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2221715Z ^ 2025-05-07T19:57:50.2221993Z 2025-05-07T19:57:50.2222385Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.2223000Z 2025-05-07T19:57:50.2224642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2227180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2228400Z ^ 2025-05-07T19:57:50.2228760Z 2025-05-07T19:57:50.2230452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2233259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2234436Z ^ 2025-05-07T19:57:50.2234705Z 2025-05-07T19:57:50.2235144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.2235816Z 2025-05-07T19:57:50.2237425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2240026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2241234Z ^ 2025-05-07T19:57:50.2241655Z 2025-05-07T19:57:50.2243420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2246279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2247534Z ^ 2025-05-07T19:57:50.2248001Z 2025-05-07T19:57:50.2248475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.2249187Z 2025-05-07T19:57:50.2250990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2253772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2254988Z ^ 2025-05-07T19:57:50.2255346Z 2025-05-07T19:57:50.2256978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2261645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2262850Z ^ 2025-05-07T19:57:50.2263109Z 2025-05-07T19:57:50.2263552Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.2264258Z 2025-05-07T19:57:50.2266522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.2269312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.2270475Z ^ 2025-05-07T19:57:50.2270880Z 2025-05-07T19:57:50.4927330Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:50.4952521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4955213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4956735Z ^ 2025-05-07T19:57:50.4957001Z 2025-05-07T19:57:50.4957458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.4958173Z 2025-05-07T19:57:50.4959852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4962901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4964146Z ^ 2025-05-07T19:57:50.4964530Z 2025-05-07T19:57:50.4966514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4969067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4985994Z ^ 2025-05-07T19:57:50.4986484Z 2025-05-07T19:57:50.4986974Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.4987679Z 2025-05-07T19:57:50.4989436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4992300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4993496Z ^ 2025-05-07T19:57:50.4993874Z 2025-05-07T19:57:50.4995550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.4998172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.4999359Z ^ 2025-05-07T19:57:50.4999609Z 2025-05-07T19:57:50.5000066Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5000783Z 2025-05-07T19:57:50.5002421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5004979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5006116Z ^ 2025-05-07T19:57:50.5006487Z 2025-05-07T19:57:50.5008076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5010954Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5012119Z ^ 2025-05-07T19:57:50.5012374Z 2025-05-07T19:57:50.5012836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5013514Z 2025-05-07T19:57:50.5015351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5018112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5019548Z ^ 2025-05-07T19:57:50.5019893Z 2025-05-07T19:57:50.5021370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5024117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5025269Z ^ 2025-05-07T19:57:50.5025519Z 2025-05-07T19:57:50.5025954Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5026592Z 2025-05-07T19:57:50.5028174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5030619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5031930Z ^ 2025-05-07T19:57:50.5032442Z 2025-05-07T19:57:50.5728442Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:50.5752940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5756090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5757314Z ^ 2025-05-07T19:57:50.5757606Z 2025-05-07T19:57:50.5758299Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5759009Z 2025-05-07T19:57:50.5760784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5763368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5764613Z ^ 2025-05-07T19:57:50.5765296Z 2025-05-07T19:57:50.5766942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5769749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5770968Z ^ 2025-05-07T19:57:50.5771234Z 2025-05-07T19:57:50.5771695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5772407Z 2025-05-07T19:57:50.5774136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5776955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5778188Z ^ 2025-05-07T19:57:50.5778604Z 2025-05-07T19:57:50.5780293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5783056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5784223Z ^ 2025-05-07T19:57:50.5784527Z 2025-05-07T19:57:50.5784985Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5785660Z 2025-05-07T19:57:50.5787363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5790153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5791739Z ^ 2025-05-07T19:57:50.5792117Z 2025-05-07T19:57:50.5793847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5796642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5797843Z ^ 2025-05-07T19:57:50.5798092Z 2025-05-07T19:57:50.5798557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5799243Z 2025-05-07T19:57:50.5801002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5803823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5805291Z ^ 2025-05-07T19:57:50.5805667Z 2025-05-07T19:57:50.5807230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5810034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5811305Z ^ 2025-05-07T19:57:50.5811578Z 2025-05-07T19:57:50.5812084Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:50.5812773Z 2025-05-07T19:57:50.5814477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:50.5817451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:50.5818675Z ^ 2025-05-07T19:57:50.5818995Z 2025-05-07T19:57:51.4189380Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:57:51.4214782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4217630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4218889Z ^ 2025-05-07T19:57:51.4219158Z 2025-05-07T19:57:51.4219605Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.4220333Z 2025-05-07T19:57:51.4222031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4224789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4225983Z ^ 2025-05-07T19:57:51.4226388Z 2025-05-07T19:57:51.4227998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4230825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4232080Z ^ 2025-05-07T19:57:51.4232343Z 2025-05-07T19:57:51.4232813Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.4233461Z 2025-05-07T19:57:51.4235081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4237788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4238928Z ^ 2025-05-07T19:57:51.4239301Z 2025-05-07T19:57:51.4240967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4243586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4244776Z ^ 2025-05-07T19:57:51.4245038Z 2025-05-07T19:57:51.4245504Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.4246361Z 2025-05-07T19:57:51.4247917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4250945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4252166Z ^ 2025-05-07T19:57:51.4252530Z 2025-05-07T19:57:51.4254193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4257004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4258443Z ^ 2025-05-07T19:57:51.4258716Z 2025-05-07T19:57:51.4259206Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.4259894Z 2025-05-07T19:57:51.4261674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4263921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4265357Z ^ 2025-05-07T19:57:51.4265714Z 2025-05-07T19:57:51.4267225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4269876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4271050Z ^ 2025-05-07T19:57:51.4271338Z 2025-05-07T19:57:51.4271929Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.4272637Z 2025-05-07T19:57:51.4274339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.4277212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.4278509Z ^ 2025-05-07T19:57:51.4278840Z 2025-05-07T19:58:04.5003126Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:04.5026453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5029162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5030402Z ^ 2025-05-07T19:58:04.5030716Z 2025-05-07T19:58:04.5031188Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.5032053Z 2025-05-07T19:58:04.5033780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5036781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5038003Z ^ 2025-05-07T19:58:04.5038404Z 2025-05-07T19:58:04.5040198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5042967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5044194Z ^ 2025-05-07T19:58:04.5044488Z 2025-05-07T19:58:04.5044946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.5045599Z 2025-05-07T19:58:04.5047352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5050131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5051384Z ^ 2025-05-07T19:58:04.5051771Z 2025-05-07T19:58:04.5053515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5056418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5057975Z ^ 2025-05-07T19:58:04.5058251Z 2025-05-07T19:58:04.5058736Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.5059484Z 2025-05-07T19:58:04.5061279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5064185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5065718Z ^ 2025-05-07T19:58:04.5066122Z 2025-05-07T19:58:04.5068049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5071170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5072561Z ^ 2025-05-07T19:58:04.5073067Z 2025-05-07T19:58:04.5073556Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.5074273Z 2025-05-07T19:58:04.5076042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5078873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5080153Z ^ 2025-05-07T19:58:04.5080542Z 2025-05-07T19:58:04.5082308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5084959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5086162Z ^ 2025-05-07T19:58:04.5086415Z 2025-05-07T19:58:04.5086780Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.5087346Z 2025-05-07T19:58:04.5088754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.5091265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.5092416Z ^ 2025-05-07T19:58:04.5092773Z 2025-05-07T19:58:14.1469429Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:14.1491808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1494376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1495460Z ^ 2025-05-07T19:58:14.1495716Z 2025-05-07T19:58:14.1496206Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.1496788Z 2025-05-07T19:58:14.1498368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1500756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1501829Z ^ 2025-05-07T19:58:14.1502202Z 2025-05-07T19:58:14.1503740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1506254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1507307Z ^ 2025-05-07T19:58:14.1507585Z 2025-05-07T19:58:14.1507986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.1508575Z 2025-05-07T19:58:14.1509825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1512230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1513317Z ^ 2025-05-07T19:58:14.1513857Z 2025-05-07T19:58:14.1515351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1517858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1518941Z ^ 2025-05-07T19:58:14.1519176Z 2025-05-07T19:58:14.1519582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.1520230Z 2025-05-07T19:58:14.1521577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1524217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1525306Z ^ 2025-05-07T19:58:14.1525676Z 2025-05-07T19:58:14.1527246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1529682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1530654Z ^ 2025-05-07T19:58:14.1530884Z 2025-05-07T19:58:14.1531221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.1531687Z 2025-05-07T19:58:14.1533147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1535880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1536960Z ^ 2025-05-07T19:58:14.1537290Z 2025-05-07T19:58:14.1538719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1541342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1542426Z ^ 2025-05-07T19:58:14.1542651Z 2025-05-07T19:58:14.1543071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:14.1543719Z 2025-05-07T19:58:14.1545223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:14.1547693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:14.1548618Z ^ 2025-05-07T19:58:14.1548989Z 2025-05-07T19:58:18.1505587Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:18.1530302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1533067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1534274Z ^ 2025-05-07T19:58:18.1534529Z 2025-05-07T19:58:18.1534973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.1535689Z 2025-05-07T19:58:18.1537374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1540317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1541717Z ^ 2025-05-07T19:58:18.1542127Z 2025-05-07T19:58:18.1543842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1546643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1547874Z ^ 2025-05-07T19:58:18.1548131Z 2025-05-07T19:58:18.1548607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.1549322Z 2025-05-07T19:58:18.1551208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1554434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1555706Z ^ 2025-05-07T19:58:18.1556072Z 2025-05-07T19:58:18.1557814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1560662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1561882Z ^ 2025-05-07T19:58:18.1562136Z 2025-05-07T19:58:18.1562739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.1563416Z 2025-05-07T19:58:18.1565321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1568437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1569689Z ^ 2025-05-07T19:58:18.1570093Z 2025-05-07T19:58:18.1571770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1574659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1575798Z ^ 2025-05-07T19:58:18.1576105Z 2025-05-07T19:58:18.1576601Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.1577302Z 2025-05-07T19:58:18.1579145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1581863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1583024Z ^ 2025-05-07T19:58:18.1583410Z 2025-05-07T19:58:18.1585044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1587786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1588988Z ^ 2025-05-07T19:58:18.1589241Z 2025-05-07T19:58:18.1589696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.1590428Z 2025-05-07T19:58:18.1592202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.1594933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.1596075Z ^ 2025-05-07T19:58:18.1596454Z 2025-05-07T19:58:18.2249729Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:18.2274289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2277361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2278574Z ^ 2025-05-07T19:58:18.2278868Z 2025-05-07T19:58:18.2279368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.2280058Z 2025-05-07T19:58:18.2281693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2284335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2285627Z ^ 2025-05-07T19:58:18.2286020Z 2025-05-07T19:58:18.2287456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2290061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2291309Z ^ 2025-05-07T19:58:18.2291565Z 2025-05-07T19:58:18.2292370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.2293069Z 2025-05-07T19:58:18.2294716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2297438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2298655Z ^ 2025-05-07T19:58:18.2299059Z 2025-05-07T19:58:18.2300747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2304083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2305363Z ^ 2025-05-07T19:58:18.2305663Z 2025-05-07T19:58:18.2306160Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.2306851Z 2025-05-07T19:58:18.2308808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2311756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2313165Z ^ 2025-05-07T19:58:18.2313544Z 2025-05-07T19:58:18.2315218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2317902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2319089Z ^ 2025-05-07T19:58:18.2319349Z 2025-05-07T19:58:18.2319806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.2320502Z 2025-05-07T19:58:18.2322175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2324868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2326074Z ^ 2025-05-07T19:58:18.2326460Z 2025-05-07T19:58:18.2328066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2330453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2331628Z ^ 2025-05-07T19:58:18.2331883Z 2025-05-07T19:58:18.2332531Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.2333192Z 2025-05-07T19:58:18.2334799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.2337659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.2339141Z ^ 2025-05-07T19:58:18.2339534Z 2025-05-07T19:58:20.8178497Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:20.8202963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8205783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8207035Z ^ 2025-05-07T19:58:20.8207256Z 2025-05-07T19:58:20.8207718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.8208337Z 2025-05-07T19:58:20.8210006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8212834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8214095Z ^ 2025-05-07T19:58:20.8214476Z 2025-05-07T19:58:20.8216213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8219110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8220291Z ^ 2025-05-07T19:58:20.8220537Z 2025-05-07T19:58:20.8220989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.8221579Z 2025-05-07T19:58:20.8223163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8225775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8227151Z ^ 2025-05-07T19:58:20.8227518Z 2025-05-07T19:58:20.8229117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8232091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8233518Z ^ 2025-05-07T19:58:20.8233776Z 2025-05-07T19:58:20.8234247Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.8234870Z 2025-05-07T19:58:20.8236503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8239073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8240310Z ^ 2025-05-07T19:58:20.8240718Z 2025-05-07T19:58:20.8242432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8244993Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8246068Z ^ 2025-05-07T19:58:20.8246348Z 2025-05-07T19:58:20.8246797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.8247475Z 2025-05-07T19:58:20.8249385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8252067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8253249Z ^ 2025-05-07T19:58:20.8253546Z 2025-05-07T19:58:20.8255179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8257765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8258937Z ^ 2025-05-07T19:58:20.8259185Z 2025-05-07T19:58:20.8259624Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:20.8260311Z 2025-05-07T19:58:20.8261943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:20.8265066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:20.8266158Z ^ 2025-05-07T19:58:20.8266534Z 2025-05-07T19:58:21.3921589Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:21.3945782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3948671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3949880Z ^ 2025-05-07T19:58:21.3950147Z 2025-05-07T19:58:21.3950619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.3951325Z 2025-05-07T19:58:21.3953202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3956006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3957591Z ^ 2025-05-07T19:58:21.3957944Z 2025-05-07T19:58:21.3959622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3962267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3963649Z ^ 2025-05-07T19:58:21.3963900Z 2025-05-07T19:58:21.3964393Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.3965372Z 2025-05-07T19:58:21.3967299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3970479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3971733Z ^ 2025-05-07T19:58:21.3972162Z 2025-05-07T19:58:21.3974173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3976963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3978161Z ^ 2025-05-07T19:58:21.3978465Z 2025-05-07T19:58:21.3978929Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.3979565Z 2025-05-07T19:58:21.3981227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3984355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3985620Z ^ 2025-05-07T19:58:21.3985989Z 2025-05-07T19:58:21.3987695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3990364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3991683Z ^ 2025-05-07T19:58:21.3991941Z 2025-05-07T19:58:21.3992385Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.3992960Z 2025-05-07T19:58:21.3994775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.3997609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.3998785Z ^ 2025-05-07T19:58:21.3999138Z 2025-05-07T19:58:21.4000766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.4003540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.4004686Z ^ 2025-05-07T19:58:21.4005267Z 2025-05-07T19:58:21.4005724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.4006323Z 2025-05-07T19:58:21.4007804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:21.4010376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:21.4011586Z ^ 2025-05-07T19:58:21.4011973Z 2025-05-07T19:58:23.8376924Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:23.8392534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8394050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8394718Z ^ 2025-05-07T19:58:23.8394909Z 2025-05-07T19:58:23.8395179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8395563Z 2025-05-07T19:58:23.8396482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8398156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8398864Z ^ 2025-05-07T19:58:23.8399081Z 2025-05-07T19:58:23.8400002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8401457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8402181Z ^ 2025-05-07T19:58:23.8402341Z 2025-05-07T19:58:23.8402631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8403016Z 2025-05-07T19:58:23.8404022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8405516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8406194Z ^ 2025-05-07T19:58:23.8406444Z 2025-05-07T19:58:23.8407344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8408834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8409507Z ^ 2025-05-07T19:58:23.8409697Z 2025-05-07T19:58:23.8409961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8410347Z 2025-05-07T19:58:23.8411278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8412743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8413454Z ^ 2025-05-07T19:58:23.8413672Z 2025-05-07T19:58:23.8414591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8416055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8416752Z ^ 2025-05-07T19:58:23.8416916Z 2025-05-07T19:58:23.8417177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8417578Z 2025-05-07T19:58:23.8418474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8419949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8420611Z ^ 2025-05-07T19:58:23.8420859Z 2025-05-07T19:58:23.8421746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8423268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8423934Z ^ 2025-05-07T19:58:23.8424116Z 2025-05-07T19:58:23.8424376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:23.8424750Z 2025-05-07T19:58:23.8425654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:23.8427287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:23.8427980Z ^ 2025-05-07T19:58:23.8428181Z 2025-05-07T19:58:24.1436284Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:24.1459661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1462184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1463593Z ^ 2025-05-07T19:58:24.1463850Z 2025-05-07T19:58:24.1464269Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1465196Z 2025-05-07T19:58:24.1466790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1469272Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1470437Z ^ 2025-05-07T19:58:24.1470791Z 2025-05-07T19:58:24.1472529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1475239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1476424Z ^ 2025-05-07T19:58:24.1476679Z 2025-05-07T19:58:24.1477368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1477999Z 2025-05-07T19:58:24.1479513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1482037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1483131Z ^ 2025-05-07T19:58:24.1483511Z 2025-05-07T19:58:24.1485024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1487738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1488929Z ^ 2025-05-07T19:58:24.1489221Z 2025-05-07T19:58:24.1489663Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1490332Z 2025-05-07T19:58:24.1491992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1494595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1495838Z ^ 2025-05-07T19:58:24.1496199Z 2025-05-07T19:58:24.1497781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1500376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1501551Z ^ 2025-05-07T19:58:24.1501796Z 2025-05-07T19:58:24.1502221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1502879Z 2025-05-07T19:58:24.1504521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1507207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1508603Z ^ 2025-05-07T19:58:24.1508992Z 2025-05-07T19:58:24.1510591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1513465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1514554Z ^ 2025-05-07T19:58:24.1514822Z 2025-05-07T19:58:24.1515245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:24.1515853Z 2025-05-07T19:58:24.1517404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.1520134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:24.1521280Z ^ 2025-05-07T19:58:24.1521793Z 2025-05-07T19:58:25.1691478Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:25.1716021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1718975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1720198Z ^ 2025-05-07T19:58:25.1720454Z 2025-05-07T19:58:25.1720899Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.1721557Z 2025-05-07T19:58:25.1723197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1725841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1727236Z ^ 2025-05-07T19:58:25.1727544Z 2025-05-07T19:58:25.1729121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1732173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1733395Z ^ 2025-05-07T19:58:25.1733662Z 2025-05-07T19:58:25.1734152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.1734882Z 2025-05-07T19:58:25.1736661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1739461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1740720Z ^ 2025-05-07T19:58:25.1741105Z 2025-05-07T19:58:25.1742778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1745578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1746783Z ^ 2025-05-07T19:58:25.1747082Z 2025-05-07T19:58:25.1747544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.1748236Z 2025-05-07T19:58:25.1750065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1753061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1754341Z ^ 2025-05-07T19:58:25.1754736Z 2025-05-07T19:58:25.1756385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1758913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1760113Z ^ 2025-05-07T19:58:25.1760368Z 2025-05-07T19:58:25.1760816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.1761532Z 2025-05-07T19:58:25.1763274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1766703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1768003Z ^ 2025-05-07T19:58:25.1768416Z 2025-05-07T19:58:25.1770117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1772878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1774282Z ^ 2025-05-07T19:58:25.1774583Z 2025-05-07T19:58:25.1775029Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:25.1775671Z 2025-05-07T19:58:25.1777545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.1780244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:25.1781373Z ^ 2025-05-07T19:58:25.1781730Z 2025-05-07T19:58:26.2147707Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:26.2172123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2175042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2176254Z ^ 2025-05-07T19:58:26.2176520Z 2025-05-07T19:58:26.2176986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.2177708Z 2025-05-07T19:58:26.2179411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2182499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2183653Z ^ 2025-05-07T19:58:26.2184003Z 2025-05-07T19:58:26.2185953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2188780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2189962Z ^ 2025-05-07T19:58:26.2190227Z 2025-05-07T19:58:26.2190696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.2191380Z 2025-05-07T19:58:26.2193207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2196282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2197494Z ^ 2025-05-07T19:58:26.2197858Z 2025-05-07T19:58:26.2199178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:26.2200896Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:26.2201347Z ^ 2025-05-07T19:58:26.2201563Z 2025-05-07T19:58:26.2202742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2205243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2206392Z ^ 2025-05-07T19:58:26.2206678Z 2025-05-07T19:58:26.2207147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.2207774Z 2025-05-07T19:58:26.2209428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2212109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2213301Z ^ 2025-05-07T19:58:26.2213672Z 2025-05-07T19:58:26.2215068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:26.2217400Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:26.2218016Z ^ 2025-05-07T19:58:26.2218298Z 2025-05-07T19:58:26.2220062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2222738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2223994Z ^ 2025-05-07T19:58:26.2224255Z 2025-05-07T19:58:26.2224835Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.2225521Z 2025-05-07T19:58:26.2227259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2230498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2231831Z ^ 2025-05-07T19:58:26.2232181Z 2025-05-07T19:58:26.2233588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:26.2235415Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:26.2236012Z ^ 2025-05-07T19:58:26.2236296Z 2025-05-07T19:58:26.2238045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2240689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2241944Z ^ 2025-05-07T19:58:26.2242215Z 2025-05-07T19:58:26.2242682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.2243391Z 2025-05-07T19:58:26.2245108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2247850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.2249044Z ^ 2025-05-07T19:58:26.2249451Z 2025-05-07T19:58:26.2250867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:26.2252741Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:26.2253318Z ^ 2025-05-07T19:58:26.2253590Z 2025-05-07T19:58:26.9367858Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:26.9380725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9382235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9382935Z ^ 2025-05-07T19:58:26.9383101Z 2025-05-07T19:58:26.9383365Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9383767Z 2025-05-07T19:58:26.9384665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9386145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9386823Z ^ 2025-05-07T19:58:26.9387059Z 2025-05-07T19:58:26.9387950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9389423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9390092Z ^ 2025-05-07T19:58:26.9390251Z 2025-05-07T19:58:26.9390532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9390907Z 2025-05-07T19:58:26.9392017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9393591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9394285Z ^ 2025-05-07T19:58:26.9394498Z 2025-05-07T19:58:26.9395403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9396864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9397550Z ^ 2025-05-07T19:58:26.9397708Z 2025-05-07T19:58:26.9397968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9398397Z 2025-05-07T19:58:26.9399362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9400893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9401592Z ^ 2025-05-07T19:58:26.9401807Z 2025-05-07T19:58:26.9402819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9404261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9404939Z ^ 2025-05-07T19:58:26.9405093Z 2025-05-07T19:58:26.9405348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9405744Z 2025-05-07T19:58:26.9406634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9408265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9408928Z ^ 2025-05-07T19:58:26.9409163Z 2025-05-07T19:58:26.9410037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9411468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9412120Z ^ 2025-05-07T19:58:26.9412274Z 2025-05-07T19:58:26.9412553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.9412924Z 2025-05-07T19:58:26.9413993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.9415482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.9416187Z ^ 2025-05-07T19:58:26.9416403Z 2025-05-07T19:58:29.9371571Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:29.9394452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9396812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9397951Z ^ 2025-05-07T19:58:29.9398166Z 2025-05-07T19:58:29.9398543Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.9399100Z 2025-05-07T19:58:29.9400578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9403238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9404427Z ^ 2025-05-07T19:58:29.9404805Z 2025-05-07T19:58:29.9406148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9408383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9409343Z ^ 2025-05-07T19:58:29.9409587Z 2025-05-07T19:58:29.9409987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.9410639Z 2025-05-07T19:58:29.9411646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9413058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9413718Z ^ 2025-05-07T19:58:29.9413923Z 2025-05-07T19:58:29.9414781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9416256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9416896Z ^ 2025-05-07T19:58:29.9417040Z 2025-05-07T19:58:29.9417285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.9417657Z 2025-05-07T19:58:29.9418613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9420026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9420658Z ^ 2025-05-07T19:58:29.9420873Z 2025-05-07T19:58:29.9421733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9423129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9423752Z ^ 2025-05-07T19:58:29.9423896Z 2025-05-07T19:58:29.9424158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.9424516Z 2025-05-07T19:58:29.9425381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9426792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9427441Z ^ 2025-05-07T19:58:29.9427645Z 2025-05-07T19:58:29.9428502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9429904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9430538Z ^ 2025-05-07T19:58:29.9430678Z 2025-05-07T19:58:29.9430918Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:29.9431273Z 2025-05-07T19:58:29.9432341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:29.9433746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:29.9434395Z ^ 2025-05-07T19:58:29.9434638Z 2025-05-07T19:58:30.9138894Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:58:30.9161480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9164280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9165727Z ^ 2025-05-07T19:58:30.9165991Z 2025-05-07T19:58:30.9166419Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.9167082Z 2025-05-07T19:58:30.9168790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9171531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9172730Z ^ 2025-05-07T19:58:30.9173088Z 2025-05-07T19:58:30.9174745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9177526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9178702Z ^ 2025-05-07T19:58:30.9178947Z 2025-05-07T19:58:30.9179389Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.9180070Z 2025-05-07T19:58:30.9181771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9184425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9185589Z ^ 2025-05-07T19:58:30.9186089Z 2025-05-07T19:58:30.9187751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9190527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9191837Z ^ 2025-05-07T19:58:30.9192089Z 2025-05-07T19:58:30.9192528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.9193170Z 2025-05-07T19:58:30.9194786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9197393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9198583Z ^ 2025-05-07T19:58:30.9198956Z 2025-05-07T19:58:30.9200632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9203263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9204435Z ^ 2025-05-07T19:58:30.9204691Z 2025-05-07T19:58:30.9205126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.9205777Z 2025-05-07T19:58:30.9207290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9209944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9211078Z ^ 2025-05-07T19:58:30.9211456Z 2025-05-07T19:58:30.9212885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9215042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9215961Z ^ 2025-05-07T19:58:30.9216173Z 2025-05-07T19:58:30.9216546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:30.9217088Z 2025-05-07T19:58:30.9218400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:30.9220902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:30.9222001Z ^ 2025-05-07T19:58:30.9222329Z 2025-05-07T19:58:32.7070164Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:32.7094231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7096878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7098061Z ^ 2025-05-07T19:58:32.7098316Z 2025-05-07T19:58:32.7098763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.7099419Z 2025-05-07T19:58:32.7101112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7103834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7105019Z ^ 2025-05-07T19:58:32.7105663Z 2025-05-07T19:58:32.7107330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7110032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7111265Z ^ 2025-05-07T19:58:32.7111674Z 2025-05-07T19:58:32.7112171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.7112832Z 2025-05-07T19:58:32.7114514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7117598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7118866Z ^ 2025-05-07T19:58:32.7119240Z 2025-05-07T19:58:32.7121070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7123824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7125021Z ^ 2025-05-07T19:58:32.7125318Z 2025-05-07T19:58:32.7125772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.7126464Z 2025-05-07T19:58:32.7128183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7130910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7132117Z ^ 2025-05-07T19:58:32.7132484Z 2025-05-07T19:58:32.7134159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7136835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7138008Z ^ 2025-05-07T19:58:32.7138263Z 2025-05-07T19:58:32.7138724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.7139548Z 2025-05-07T19:58:32.7141185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7143849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7145002Z ^ 2025-05-07T19:58:32.7145363Z 2025-05-07T19:58:32.7146899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7149472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7150608Z ^ 2025-05-07T19:58:32.7150857Z 2025-05-07T19:58:32.7151575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.7152204Z 2025-05-07T19:58:32.7153826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7156400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.7157544Z ^ 2025-05-07T19:58:32.7157888Z 2025-05-07T19:58:33.4804324Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:33.4829185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4832189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4833428Z ^ 2025-05-07T19:58:33.4833686Z 2025-05-07T19:58:33.4834186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.4834894Z 2025-05-07T19:58:33.4836523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4839617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4841040Z ^ 2025-05-07T19:58:33.4841418Z 2025-05-07T19:58:33.4843083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4845852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4847090Z ^ 2025-05-07T19:58:33.4847478Z 2025-05-07T19:58:33.4847942Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.4848643Z 2025-05-07T19:58:33.4850273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4853043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4854234Z ^ 2025-05-07T19:58:33.4854619Z 2025-05-07T19:58:33.4856256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4859098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4860296Z ^ 2025-05-07T19:58:33.4860562Z 2025-05-07T19:58:33.4861052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.4861681Z 2025-05-07T19:58:33.4863360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4866464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4867702Z ^ 2025-05-07T19:58:33.4868061Z 2025-05-07T19:58:33.4869881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4872776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4874157Z ^ 2025-05-07T19:58:33.4874413Z 2025-05-07T19:58:33.4874849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.4875526Z 2025-05-07T19:58:33.4877231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4879953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4881211Z ^ 2025-05-07T19:58:33.4881589Z 2025-05-07T19:58:33.4883328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4886242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4887455Z ^ 2025-05-07T19:58:33.4887723Z 2025-05-07T19:58:33.4888207Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:33.4888887Z 2025-05-07T19:58:33.4890533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.4893314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:33.4895008Z ^ 2025-05-07T19:58:33.4895338Z 2025-05-07T19:58:35.0974928Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:35.0997429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.0999889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1001329Z ^ 2025-05-07T19:58:35.1001583Z 2025-05-07T19:58:35.1001977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.1002784Z 2025-05-07T19:58:35.1004425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1007170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1008214Z ^ 2025-05-07T19:58:35.1008570Z 2025-05-07T19:58:35.1010224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1012824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1014055Z ^ 2025-05-07T19:58:35.1014284Z 2025-05-07T19:58:35.1014715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.1015483Z 2025-05-07T19:58:35.1017339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1019877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1020873Z ^ 2025-05-07T19:58:35.1021243Z 2025-05-07T19:58:35.1022858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1025287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1026433Z ^ 2025-05-07T19:58:35.1026663Z 2025-05-07T19:58:35.1027136Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.1027701Z 2025-05-07T19:58:35.1029259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1031865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1033070Z ^ 2025-05-07T19:58:35.1033423Z 2025-05-07T19:58:35.1034860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1037448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1038458Z ^ 2025-05-07T19:58:35.1038689Z 2025-05-07T19:58:35.1039114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.1039815Z 2025-05-07T19:58:35.1041365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1043903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1045164Z ^ 2025-05-07T19:58:35.1045479Z 2025-05-07T19:58:35.1047094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1049545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1050682Z ^ 2025-05-07T19:58:35.1050907Z 2025-05-07T19:58:35.1051304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:35.1051881Z 2025-05-07T19:58:35.1053501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1056050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:35.1057146Z ^ 2025-05-07T19:58:35.1057473Z 2025-05-07T19:58:36.4203289Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:36.4227697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4230430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4231286Z ^ 2025-05-07T19:58:36.4231712Z 2025-05-07T19:58:36.4232087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.4232743Z 2025-05-07T19:58:36.4234390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4236873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4238340Z ^ 2025-05-07T19:58:36.4238709Z 2025-05-07T19:58:36.4240595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4243549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4244820Z ^ 2025-05-07T19:58:36.4245075Z 2025-05-07T19:58:36.4245536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.4246230Z 2025-05-07T19:58:36.4247970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4250495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4251545Z ^ 2025-05-07T19:58:36.4251848Z 2025-05-07T19:58:36.4253314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4255906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4257041Z ^ 2025-05-07T19:58:36.4257321Z 2025-05-07T19:58:36.4257779Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.4258460Z 2025-05-07T19:58:36.4259965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4262425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4263594Z ^ 2025-05-07T19:58:36.4263944Z 2025-05-07T19:58:36.4266431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4269129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4270322Z ^ 2025-05-07T19:58:36.4270586Z 2025-05-07T19:58:36.4271063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.4271773Z 2025-05-07T19:58:36.4273330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4276506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4277668Z ^ 2025-05-07T19:58:36.4278049Z 2025-05-07T19:58:36.4279634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4282478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4283895Z ^ 2025-05-07T19:58:36.4284193Z 2025-05-07T19:58:36.4284664Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.4285540Z 2025-05-07T19:58:36.4287493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.4290268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.4291341Z ^ 2025-05-07T19:58:36.4291665Z 2025-05-07T19:58:38.7194081Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:38.7218150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7221080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7222364Z ^ 2025-05-07T19:58:38.7222651Z 2025-05-07T19:58:38.7223105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.7223850Z 2025-05-07T19:58:38.7225465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7228504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7229725Z ^ 2025-05-07T19:58:38.7230378Z 2025-05-07T19:58:38.7232304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7234889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7236028Z ^ 2025-05-07T19:58:38.7236288Z 2025-05-07T19:58:38.7236734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.7237427Z 2025-05-07T19:58:38.7239174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7241951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7243166Z ^ 2025-05-07T19:58:38.7243533Z 2025-05-07T19:58:38.7245261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7247963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7249165Z ^ 2025-05-07T19:58:38.7249426Z 2025-05-07T19:58:38.7249865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.7250567Z 2025-05-07T19:58:38.7252279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7255051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7256250Z ^ 2025-05-07T19:58:38.7256624Z 2025-05-07T19:58:38.7258232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7260891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7262313Z ^ 2025-05-07T19:58:38.7262557Z 2025-05-07T19:58:38.7263051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.7263753Z 2025-05-07T19:58:38.7265790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7268603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7269922Z ^ 2025-05-07T19:58:38.7270280Z 2025-05-07T19:58:38.7272068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7274968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7276173Z ^ 2025-05-07T19:58:38.7294559Z 2025-05-07T19:58:38.7295535Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:38.7296327Z 2025-05-07T19:58:38.7298094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.7300932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:38.7302112Z ^ 2025-05-07T19:58:38.7302530Z 2025-05-07T19:58:39.8772801Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:39.8796066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8798705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8800106Z ^ 2025-05-07T19:58:39.8800358Z 2025-05-07T19:58:39.8800799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.8801476Z 2025-05-07T19:58:39.8803294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8806208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8807388Z ^ 2025-05-07T19:58:39.8807765Z 2025-05-07T19:58:39.8809324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8811859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8813012Z ^ 2025-05-07T19:58:39.8813308Z 2025-05-07T19:58:39.8813754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.8814446Z 2025-05-07T19:58:39.8816132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8818662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8819807Z ^ 2025-05-07T19:58:39.8820148Z 2025-05-07T19:58:39.8821749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8824274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8825394Z ^ 2025-05-07T19:58:39.8825627Z 2025-05-07T19:58:39.8826075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.8826752Z 2025-05-07T19:58:39.8828339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8830920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8832170Z ^ 2025-05-07T19:58:39.8832563Z 2025-05-07T19:58:39.8834132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8836952Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8838074Z ^ 2025-05-07T19:58:39.8838350Z 2025-05-07T19:58:39.8838762Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.8839398Z 2025-05-07T19:58:39.8840963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8843495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8844749Z ^ 2025-05-07T19:58:39.8845079Z 2025-05-07T19:58:39.8846919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8849527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8850649Z ^ 2025-05-07T19:58:39.8850888Z 2025-05-07T19:58:39.8851313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:39.8851990Z 2025-05-07T19:58:39.8853567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8856245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:39.8857488Z ^ 2025-05-07T19:58:39.8857898Z 2025-05-07T19:58:43.2324081Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:43.2349851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2352889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2354132Z ^ 2025-05-07T19:58:43.2354699Z 2025-05-07T19:58:43.2355212Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.2355907Z 2025-05-07T19:58:43.2357601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2360453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2361736Z ^ 2025-05-07T19:58:43.2362120Z 2025-05-07T19:58:43.2363885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2366916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2367935Z ^ 2025-05-07T19:58:43.2368163Z 2025-05-07T19:58:43.2368566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.2369200Z 2025-05-07T19:58:43.2370858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2373609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2375078Z ^ 2025-05-07T19:58:43.2375468Z 2025-05-07T19:58:43.2377272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2380095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2381376Z ^ 2025-05-07T19:58:43.2381644Z 2025-05-07T19:58:43.2382134Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.2382830Z 2025-05-07T19:58:43.2384639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2387631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2388718Z ^ 2025-05-07T19:58:43.2389036Z 2025-05-07T19:58:43.2390307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2392564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2393607Z ^ 2025-05-07T19:58:43.2393868Z 2025-05-07T19:58:43.2394290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.2395101Z 2025-05-07T19:58:43.2396598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2399424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2400635Z ^ 2025-05-07T19:58:43.2401014Z 2025-05-07T19:58:43.2402730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2405466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2406681Z ^ 2025-05-07T19:58:43.2406937Z 2025-05-07T19:58:43.2407354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.2407907Z 2025-05-07T19:58:43.2409385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.2411688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.2412707Z ^ 2025-05-07T19:58:43.2413041Z 2025-05-07T19:58:45.0035354Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:45.0059722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0062620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0063796Z ^ 2025-05-07T19:58:45.0064054Z 2025-05-07T19:58:45.0064479Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.0065381Z 2025-05-07T19:58:45.0067091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0069765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0070802Z ^ 2025-05-07T19:58:45.0071171Z 2025-05-07T19:58:45.0072905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0075580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0076711Z ^ 2025-05-07T19:58:45.0076995Z 2025-05-07T19:58:45.0077457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.0078131Z 2025-05-07T19:58:45.0079727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0082526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0083631Z ^ 2025-05-07T19:58:45.0083984Z 2025-05-07T19:58:45.0085556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0088227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0089438Z ^ 2025-05-07T19:58:45.0089691Z 2025-05-07T19:58:45.0090113Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.0091064Z 2025-05-07T19:58:45.0092786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0095347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0096489Z ^ 2025-05-07T19:58:45.0096840Z 2025-05-07T19:58:45.0098162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0100689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0101864Z ^ 2025-05-07T19:58:45.0102104Z 2025-05-07T19:58:45.0102523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.0103133Z 2025-05-07T19:58:45.0104850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0107390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0108513Z ^ 2025-05-07T19:58:45.0108891Z 2025-05-07T19:58:45.0110521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0113443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0114658Z ^ 2025-05-07T19:58:45.0114909Z 2025-05-07T19:58:45.0115338Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.0116046Z 2025-05-07T19:58:45.0117775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.0120511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.0121671Z ^ 2025-05-07T19:58:45.0122055Z 2025-05-07T19:58:45.6332524Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:58:45.6356696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6359623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6360824Z ^ 2025-05-07T19:58:45.6361142Z 2025-05-07T19:58:45.6361576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.6362285Z 2025-05-07T19:58:45.6364031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6367066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6368326Z ^ 2025-05-07T19:58:45.6368707Z 2025-05-07T19:58:45.6370363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6373232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6374448Z ^ 2025-05-07T19:58:45.6374713Z 2025-05-07T19:58:45.6375162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.6375814Z 2025-05-07T19:58:45.6377537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6380163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6381335Z ^ 2025-05-07T19:58:45.6381733Z 2025-05-07T19:58:45.6383337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6386027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6387456Z ^ 2025-05-07T19:58:45.6387704Z 2025-05-07T19:58:45.6388171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.6388874Z 2025-05-07T19:58:45.6390593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6393502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6394763Z ^ 2025-05-07T19:58:45.6395140Z 2025-05-07T19:58:45.6397052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6399764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6402477Z ^ 2025-05-07T19:58:45.6402863Z 2025-05-07T19:58:45.6403332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.6403983Z 2025-05-07T19:58:45.6405657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6408407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6409635Z ^ 2025-05-07T19:58:45.6410012Z 2025-05-07T19:58:45.6411715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6414394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6415574Z ^ 2025-05-07T19:58:45.6415835Z 2025-05-07T19:58:45.6416313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:45.6416988Z 2025-05-07T19:58:45.6418686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.6421566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:45.6422756Z ^ 2025-05-07T19:58:45.6423145Z 2025-05-07T19:58:54.6159854Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:54.6181276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6183680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6184687Z ^ 2025-05-07T19:58:54.6184933Z 2025-05-07T19:58:54.6185373Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.6186035Z 2025-05-07T19:58:54.6187575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6190156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6191343Z ^ 2025-05-07T19:58:54.6191846Z 2025-05-07T19:58:54.6193406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6195976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6197176Z ^ 2025-05-07T19:58:54.6197430Z 2025-05-07T19:58:54.6197879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.6198523Z 2025-05-07T19:58:54.6200108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6202685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6203790Z ^ 2025-05-07T19:58:54.6204433Z 2025-05-07T19:58:54.6205978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6208505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6209654Z ^ 2025-05-07T19:58:54.6209879Z 2025-05-07T19:58:54.6210279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.6210879Z 2025-05-07T19:58:54.6212335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6214975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6216070Z ^ 2025-05-07T19:58:54.6216387Z 2025-05-07T19:58:54.6220529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6223165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6224287Z ^ 2025-05-07T19:58:54.6224548Z 2025-05-07T19:58:54.6224969Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.6225580Z 2025-05-07T19:58:54.6227053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6229505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6230363Z ^ 2025-05-07T19:58:54.6230631Z 2025-05-07T19:58:54.6232073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6234553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6235639Z ^ 2025-05-07T19:58:54.6235883Z 2025-05-07T19:58:54.6236320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.6236886Z 2025-05-07T19:58:54.6238401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.6240980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.6242148Z ^ 2025-05-07T19:58:54.6242482Z 2025-05-07T19:58:59.1557486Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:59.1583585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1586632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1587962Z ^ 2025-05-07T19:58:59.1588254Z 2025-05-07T19:58:59.1588749Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.1589510Z 2025-05-07T19:58:59.1591284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1594099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1595422Z ^ 2025-05-07T19:58:59.1595828Z 2025-05-07T19:58:59.1597572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1600382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1601707Z ^ 2025-05-07T19:58:59.1601998Z 2025-05-07T19:58:59.1602509Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.1603238Z 2025-05-07T19:58:59.1605053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1608106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1609458Z ^ 2025-05-07T19:58:59.1609860Z 2025-05-07T19:58:59.1611635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1614425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1615726Z ^ 2025-05-07T19:58:59.1616046Z 2025-05-07T19:58:59.1616751Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.1617480Z 2025-05-07T19:58:59.1619318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1622345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1623698Z ^ 2025-05-07T19:58:59.1624098Z 2025-05-07T19:58:59.1625764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1628656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1629985Z ^ 2025-05-07T19:58:59.1630279Z 2025-05-07T19:58:59.1630806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.1631710Z 2025-05-07T19:58:59.1633346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1636271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1637461Z ^ 2025-05-07T19:58:59.1637899Z 2025-05-07T19:58:59.1639670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1642467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1643729Z ^ 2025-05-07T19:58:59.1644022Z 2025-05-07T19:58:59.1644514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:59.1645233Z 2025-05-07T19:58:59.1647081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.1649992Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:59.1651180Z ^ 2025-05-07T19:58:59.1651586Z 2025-05-07T19:59:02.5508198Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:02.5534168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5537260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5538603Z ^ 2025-05-07T19:59:02.5538905Z 2025-05-07T19:59:02.5539433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.5540175Z 2025-05-07T19:59:02.5541963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5544919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5546272Z ^ 2025-05-07T19:59:02.5546677Z 2025-05-07T19:59:02.5548463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5551371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5552914Z ^ 2025-05-07T19:59:02.5553205Z 2025-05-07T19:59:02.5553964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.5554610Z 2025-05-07T19:59:02.5556435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5559355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5560663Z ^ 2025-05-07T19:59:02.5561097Z 2025-05-07T19:59:02.5562879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5566464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5567774Z ^ 2025-05-07T19:59:02.5568091Z 2025-05-07T19:59:02.5568585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.5569304Z 2025-05-07T19:59:02.5571539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5574480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5575816Z ^ 2025-05-07T19:59:02.5576218Z 2025-05-07T19:59:02.5578017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5580921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5582248Z ^ 2025-05-07T19:59:02.5582537Z 2025-05-07T19:59:02.5583048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.5583805Z 2025-05-07T19:59:02.5585612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5588557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5589889Z ^ 2025-05-07T19:59:02.5590326Z 2025-05-07T19:59:02.5592234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5595174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5596462Z ^ 2025-05-07T19:59:02.5596777Z 2025-05-07T19:59:02.5597272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.5597993Z 2025-05-07T19:59:02.5599772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.5602717Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.5604296Z ^ 2025-05-07T19:59:02.5604701Z 2025-05-07T19:59:03.0346968Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:03.0373206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0376317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0377658Z ^ 2025-05-07T19:59:03.0377947Z 2025-05-07T19:59:03.0378434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.0379018Z 2025-05-07T19:59:03.0380846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0383764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0385060Z ^ 2025-05-07T19:59:03.0385464Z 2025-05-07T19:59:03.0387264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0390508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0391985Z ^ 2025-05-07T19:59:03.0392312Z 2025-05-07T19:59:03.0392832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.0393554Z 2025-05-07T19:59:03.0395351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0398280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0399850Z ^ 2025-05-07T19:59:03.0400252Z 2025-05-07T19:59:03.0402023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0405178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0406483Z ^ 2025-05-07T19:59:03.0406800Z 2025-05-07T19:59:03.0407295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.0408023Z 2025-05-07T19:59:03.0409844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0412765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0414093Z ^ 2025-05-07T19:59:03.0414499Z 2025-05-07T19:59:03.0416319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0419219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0420531Z ^ 2025-05-07T19:59:03.0420810Z 2025-05-07T19:59:03.0421303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.0422061Z 2025-05-07T19:59:03.0423878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0426863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0428193Z ^ 2025-05-07T19:59:03.0428628Z 2025-05-07T19:59:03.0430387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0433471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0434764Z ^ 2025-05-07T19:59:03.0435071Z 2025-05-07T19:59:03.0435556Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.0436290Z 2025-05-07T19:59:03.0438085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.0441193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.0442480Z ^ 2025-05-07T19:59:03.0442885Z 2025-05-07T19:59:03.5157004Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:59:03.5183623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5186780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5188110Z ^ 2025-05-07T19:59:03.5188409Z 2025-05-07T19:59:03.5188933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.5189664Z 2025-05-07T19:59:03.5191612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5194590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5196182Z ^ 2025-05-07T19:59:03.5196590Z 2025-05-07T19:59:03.5198363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5200597Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5201244Z ^ 2025-05-07T19:59:03.5201613Z 2025-05-07T19:59:03.5203346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5205799Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5206446Z ^ 2025-05-07T19:59:03.5206795Z 2025-05-07T19:59:03.5208550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5210955Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5211634Z ^ 2025-05-07T19:59:03.5211973Z 2025-05-07T19:59:03.5213772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5216647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5217979Z ^ 2025-05-07T19:59:03.5218241Z 2025-05-07T19:59:03.5218739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.5219479Z 2025-05-07T19:59:03.5221287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5224213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5225519Z ^ 2025-05-07T19:59:03.5225952Z 2025-05-07T19:59:03.5227683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5229903Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5230510Z ^ 2025-05-07T19:59:03.5230790Z 2025-05-07T19:59:03.5232690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5234932Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5235562Z ^ 2025-05-07T19:59:03.5235893Z 2025-05-07T19:59:03.5237648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5239849Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5240512Z ^ 2025-05-07T19:59:03.5240853Z 2025-05-07T19:59:03.5242715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5245785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5247129Z ^ 2025-05-07T19:59:03.5247421Z 2025-05-07T19:59:03.5247902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.5248656Z 2025-05-07T19:59:03.5250445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5253368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5254814Z ^ 2025-05-07T19:59:03.5255259Z 2025-05-07T19:59:03.5256993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5259397Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5260040Z ^ 2025-05-07T19:59:03.5260377Z 2025-05-07T19:59:03.5262144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5264326Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5265301Z ^ 2025-05-07T19:59:03.5265643Z 2025-05-07T19:59:03.5267415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5269615Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5270270Z ^ 2025-05-07T19:59:03.5270606Z 2025-05-07T19:59:03.5272455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5275373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5276688Z ^ 2025-05-07T19:59:03.5276976Z 2025-05-07T19:59:03.5277470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.5278235Z 2025-05-07T19:59:03.5280031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5282906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5284225Z ^ 2025-05-07T19:59:03.5284653Z 2025-05-07T19:59:03.5286383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5288585Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5289212Z ^ 2025-05-07T19:59:03.5289546Z 2025-05-07T19:59:03.5291234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5293724Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5294386Z ^ 2025-05-07T19:59:03.5294721Z 2025-05-07T19:59:03.5296494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5298687Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5299345Z ^ 2025-05-07T19:59:03.5299684Z 2025-05-07T19:59:03.5301458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5304515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5305494Z ^ 2025-05-07T19:59:03.5305789Z 2025-05-07T19:59:03.5306281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.5307028Z 2025-05-07T19:59:03.5309098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5312098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.5313413Z ^ 2025-05-07T19:59:03.5313817Z 2025-05-07T19:59:03.5315481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5317705Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5318380Z ^ 2025-05-07T19:59:03.5318714Z 2025-05-07T19:59:03.5320482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5322672Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5323332Z ^ 2025-05-07T19:59:03.5323673Z 2025-05-07T19:59:03.5325411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:03.5327447Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:03.5328115Z ^ 2025-05-07T19:59:03.5328465Z 2025-05-07T19:59:03.7564585Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:03.7590509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7593807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7595172Z ^ 2025-05-07T19:59:03.7595466Z 2025-05-07T19:59:03.7595966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7596727Z 2025-05-07T19:59:03.7598356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7601296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7602549Z ^ 2025-05-07T19:59:03.7602983Z 2025-05-07T19:59:03.7604759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7607719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7608845Z ^ 2025-05-07T19:59:03.7609174Z 2025-05-07T19:59:03.7609695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7610408Z 2025-05-07T19:59:03.7612226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7615123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7616465Z ^ 2025-05-07T19:59:03.7616872Z 2025-05-07T19:59:03.7618661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7621881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7623199Z ^ 2025-05-07T19:59:03.7623490Z 2025-05-07T19:59:03.7623977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7624724Z 2025-05-07T19:59:03.7626519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7629465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7631052Z ^ 2025-05-07T19:59:03.7631623Z 2025-05-07T19:59:03.7633636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7636542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7637838Z ^ 2025-05-07T19:59:03.7638126Z 2025-05-07T19:59:03.7638646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7639369Z 2025-05-07T19:59:03.7641169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7644039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7645375Z ^ 2025-05-07T19:59:03.7645782Z 2025-05-07T19:59:03.7647591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7650489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7651822Z ^ 2025-05-07T19:59:03.7652108Z 2025-05-07T19:59:03.7652595Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:03.7653331Z 2025-05-07T19:59:03.7655155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.7658264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:03.7659621Z ^ 2025-05-07T19:59:03.7660034Z 2025-05-07T19:59:04.6545519Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:04.6568038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6570770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6571865Z ^ 2025-05-07T19:59:04.6572137Z 2025-05-07T19:59:04.6572604Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.6573162Z 2025-05-07T19:59:04.6574482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6576913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6578073Z ^ 2025-05-07T19:59:04.6578428Z 2025-05-07T19:59:04.6579900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6582264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6583234Z ^ 2025-05-07T19:59:04.6583456Z 2025-05-07T19:59:04.6583863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.6584480Z 2025-05-07T19:59:04.6586008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6588774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6589794Z ^ 2025-05-07T19:59:04.6590088Z 2025-05-07T19:59:04.6591644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6594103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6595244Z ^ 2025-05-07T19:59:04.6595530Z 2025-05-07T19:59:04.6595948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.6596766Z 2025-05-07T19:59:04.6598236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6600704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6602046Z ^ 2025-05-07T19:59:04.6602400Z 2025-05-07T19:59:04.6603960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6606512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6607507Z ^ 2025-05-07T19:59:04.6607755Z 2025-05-07T19:59:04.6608186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.6608805Z 2025-05-07T19:59:04.6610372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6613034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6614215Z ^ 2025-05-07T19:59:04.6614566Z 2025-05-07T19:59:04.6616153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6618540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6619674Z ^ 2025-05-07T19:59:04.6619933Z 2025-05-07T19:59:04.6620306Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.6620910Z 2025-05-07T19:59:04.6622421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.6624852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.6626007Z ^ 2025-05-07T19:59:04.6626338Z 2025-05-07T19:59:04.8212073Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:59:04.8235753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8238567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8239746Z ^ 2025-05-07T19:59:04.8240006Z 2025-05-07T19:59:04.8240470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.8241147Z 2025-05-07T19:59:04.8242821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8245510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8246714Z ^ 2025-05-07T19:59:04.8247075Z 2025-05-07T19:59:04.8248729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8251384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8252590Z ^ 2025-05-07T19:59:04.8252859Z 2025-05-07T19:59:04.8253319Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.8254219Z 2025-05-07T19:59:04.8255921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8258606Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8259832Z ^ 2025-05-07T19:59:04.8260191Z 2025-05-07T19:59:04.8261884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8264555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8266201Z ^ 2025-05-07T19:59:04.8266470Z 2025-05-07T19:59:04.8266947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.8267624Z 2025-05-07T19:59:04.8271322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8274202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8275402Z ^ 2025-05-07T19:59:04.8275802Z 2025-05-07T19:59:04.8277463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8280167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8281365Z ^ 2025-05-07T19:59:04.8281647Z 2025-05-07T19:59:04.8282105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.8282786Z 2025-05-07T19:59:04.8284509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8287186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8288409Z ^ 2025-05-07T19:59:04.8288779Z 2025-05-07T19:59:04.8290467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8293177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8294348Z ^ 2025-05-07T19:59:04.8294632Z 2025-05-07T19:59:04.8295093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:04.8295793Z 2025-05-07T19:59:04.8297480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8300195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:04.8301389Z ^ 2025-05-07T19:59:04.8301778Z 2025-05-07T19:59:04.9975149Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:05.0000884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0003907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0005254Z ^ 2025-05-07T19:59:05.0005597Z 2025-05-07T19:59:05.0006100Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0006691Z 2025-05-07T19:59:05.0008526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0011449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0012780Z ^ 2025-05-07T19:59:05.0013281Z 2025-05-07T19:59:05.0015084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0017990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0019527Z ^ 2025-05-07T19:59:05.0019818Z 2025-05-07T19:59:05.0020301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0021056Z 2025-05-07T19:59:05.0022875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0025802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0027111Z ^ 2025-05-07T19:59:05.0027543Z 2025-05-07T19:59:05.0029311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0032515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0033814Z ^ 2025-05-07T19:59:05.0034317Z 2025-05-07T19:59:05.0034812Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0035538Z 2025-05-07T19:59:05.0037365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0040264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0041597Z ^ 2025-05-07T19:59:05.0042036Z 2025-05-07T19:59:05.0043850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0046747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0048077Z ^ 2025-05-07T19:59:05.0048367Z 2025-05-07T19:59:05.0048888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0049614Z 2025-05-07T19:59:05.0051415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0054372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0055706Z ^ 2025-05-07T19:59:05.0056143Z 2025-05-07T19:59:05.0057939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0060884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0062187Z ^ 2025-05-07T19:59:05.0062495Z 2025-05-07T19:59:05.0062981Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0063717Z 2025-05-07T19:59:05.0065832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0068969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0070297Z ^ 2025-05-07T19:59:05.0070706Z 2025-05-07T19:59:05.0542759Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:05.0568941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0571980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0573289Z ^ 2025-05-07T19:59:05.0573617Z 2025-05-07T19:59:05.0574150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0574880Z 2025-05-07T19:59:05.0576651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0579623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0580991Z ^ 2025-05-07T19:59:05.0581400Z 2025-05-07T19:59:05.0583422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0586372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0587700Z ^ 2025-05-07T19:59:05.0587991Z 2025-05-07T19:59:05.0588480Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0589234Z 2025-05-07T19:59:05.0591040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0594306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0595539Z ^ 2025-05-07T19:59:05.0595938Z 2025-05-07T19:59:05.0597972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0600854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0602181Z ^ 2025-05-07T19:59:05.0602459Z 2025-05-07T19:59:05.0602953Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0603667Z 2025-05-07T19:59:05.0605466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0608399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0609706Z ^ 2025-05-07T19:59:05.0610100Z 2025-05-07T19:59:05.0611905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0614809Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0616094Z ^ 2025-05-07T19:59:05.0616398Z 2025-05-07T19:59:05.0616889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0617627Z 2025-05-07T19:59:05.0619465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0622438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0623794Z ^ 2025-05-07T19:59:05.0624210Z 2025-05-07T19:59:05.0626039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0628977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0630344Z ^ 2025-05-07T19:59:05.0630644Z 2025-05-07T19:59:05.0631147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:05.0632233Z 2025-05-07T19:59:05.0634040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0636983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:05.0638289Z ^ 2025-05-07T19:59:05.0638721Z 2025-05-07T19:59:08.3612415Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:08.3633849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3636378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3637593Z ^ 2025-05-07T19:59:08.3637847Z 2025-05-07T19:59:08.3638306Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:08.3638967Z 2025-05-07T19:59:08.3640468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3643147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3644181Z ^ 2025-05-07T19:59:08.3644587Z 2025-05-07T19:59:08.3646071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3648540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3649605Z ^ 2025-05-07T19:59:08.3649871Z 2025-05-07T19:59:08.3650292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:08.3651022Z 2025-05-07T19:59:08.3652470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3655126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3656256Z ^ 2025-05-07T19:59:08.3656600Z 2025-05-07T19:59:08.3658113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3660488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3661590Z ^ 2025-05-07T19:59:08.3661825Z 2025-05-07T19:59:08.3662241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:08.3662865Z 2025-05-07T19:59:08.3664399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3667070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3668054Z ^ 2025-05-07T19:59:08.3668413Z 2025-05-07T19:59:08.3669935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3672486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3673535Z ^ 2025-05-07T19:59:08.3673820Z 2025-05-07T19:59:08.3674260Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:08.3674898Z 2025-05-07T19:59:08.3676370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3678783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3679889Z ^ 2025-05-07T19:59:08.3680218Z 2025-05-07T19:59:08.3681693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3684293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3685258Z ^ 2025-05-07T19:59:08.3685498Z 2025-05-07T19:59:08.3685926Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:08.3686476Z 2025-05-07T19:59:08.3687907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:08.3690336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:08.3691461Z ^ 2025-05-07T19:59:08.3692046Z 2025-05-07T19:59:09.5174366Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:09.5195783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5198437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5199595Z ^ 2025-05-07T19:59:09.5199849Z 2025-05-07T19:59:09.5200267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5201225Z 2025-05-07T19:59:09.5202898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5205478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5206569Z ^ 2025-05-07T19:59:09.5206941Z 2025-05-07T19:59:09.5208561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5211298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5212400Z ^ 2025-05-07T19:59:09.5212613Z 2025-05-07T19:59:09.5213022Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5213632Z 2025-05-07T19:59:09.5215516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5218136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5219274Z ^ 2025-05-07T19:59:09.5219619Z 2025-05-07T19:59:09.5221070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5223622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5224745Z ^ 2025-05-07T19:59:09.5225004Z 2025-05-07T19:59:09.5225461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5226096Z 2025-05-07T19:59:09.5227706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5230280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5231657Z ^ 2025-05-07T19:59:09.5232024Z 2025-05-07T19:59:09.5233560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5236035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5237207Z ^ 2025-05-07T19:59:09.5237467Z 2025-05-07T19:59:09.5237935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5238584Z 2025-05-07T19:59:09.5240187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5242782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5243953Z ^ 2025-05-07T19:59:09.5245670Z 2025-05-07T19:59:09.5247103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5249666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5250782Z ^ 2025-05-07T19:59:09.5251065Z 2025-05-07T19:59:09.5251475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:09.5252106Z 2025-05-07T19:59:09.5253697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5256438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:09.5257623Z ^ 2025-05-07T19:59:09.5257983Z 2025-05-07T19:59:11.8192555Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:11.8214194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8216881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8217969Z ^ 2025-05-07T19:59:11.8218196Z 2025-05-07T19:59:11.8218653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.8219258Z 2025-05-07T19:59:11.8220791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8223400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8224538Z ^ 2025-05-07T19:59:11.8225157Z 2025-05-07T19:59:11.8226607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8229232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8230324Z ^ 2025-05-07T19:59:11.8230571Z 2025-05-07T19:59:11.8230990Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.8231785Z 2025-05-07T19:59:11.8240683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8243396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8244532Z ^ 2025-05-07T19:59:11.8244932Z 2025-05-07T19:59:11.8246535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8249163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8250320Z ^ 2025-05-07T19:59:11.8250598Z 2025-05-07T19:59:11.8251007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.8251634Z 2025-05-07T19:59:11.8253241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8255778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8256931Z ^ 2025-05-07T19:59:11.8257294Z 2025-05-07T19:59:11.8258831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8261326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8262453Z ^ 2025-05-07T19:59:11.8262715Z 2025-05-07T19:59:11.8263166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.8263817Z 2025-05-07T19:59:11.8265695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8268544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8269682Z ^ 2025-05-07T19:59:11.8270024Z 2025-05-07T19:59:11.8271775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8274294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8275480Z ^ 2025-05-07T19:59:11.8275732Z 2025-05-07T19:59:11.8276418Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.8277074Z 2025-05-07T19:59:11.8278667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8281505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.8282706Z ^ 2025-05-07T19:59:11.8283068Z 2025-05-07T19:59:12.0464146Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:12.0487127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0489753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0490921Z ^ 2025-05-07T19:59:12.0491187Z 2025-05-07T19:59:12.0491615Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.0492240Z 2025-05-07T19:59:12.0493840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0496754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0497928Z ^ 2025-05-07T19:59:12.0498327Z 2025-05-07T19:59:12.0499930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0502107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0503177Z ^ 2025-05-07T19:59:12.0503415Z 2025-05-07T19:59:12.0504057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.0504646Z 2025-05-07T19:59:12.0506157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0508466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0509598Z ^ 2025-05-07T19:59:12.0509931Z 2025-05-07T19:59:12.0511161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0513287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0514323Z ^ 2025-05-07T19:59:12.0514605Z 2025-05-07T19:59:12.0515030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.0515653Z 2025-05-07T19:59:12.0517127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0519329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0520204Z ^ 2025-05-07T19:59:12.0520533Z 2025-05-07T19:59:12.0522060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0524111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0525137Z ^ 2025-05-07T19:59:12.0525611Z 2025-05-07T19:59:12.0526017Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.0526621Z 2025-05-07T19:59:12.0528188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0530567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0531673Z ^ 2025-05-07T19:59:12.0532046Z 2025-05-07T19:59:12.0533180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0535767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0536853Z ^ 2025-05-07T19:59:12.0537119Z 2025-05-07T19:59:12.0537525Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:12.0538139Z 2025-05-07T19:59:12.0539622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.0542007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:12.0542949Z ^ 2025-05-07T19:59:12.0543374Z 2025-05-07T19:59:13.8167880Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:13.8180445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8181912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8182580Z ^ 2025-05-07T19:59:13.8182759Z 2025-05-07T19:59:13.8183141Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:13.8183513Z 2025-05-07T19:59:13.8184434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8185942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8186637Z ^ 2025-05-07T19:59:13.8186850Z 2025-05-07T19:59:13.8187725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8189239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8189913Z ^ 2025-05-07T19:59:13.8190069Z 2025-05-07T19:59:13.8190328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:13.8190723Z 2025-05-07T19:59:13.8191858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8193310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8193967Z ^ 2025-05-07T19:59:13.8194203Z 2025-05-07T19:59:13.8195080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8196514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8197153Z ^ 2025-05-07T19:59:13.8197328Z 2025-05-07T19:59:13.8197578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:13.8197942Z 2025-05-07T19:59:13.8198838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8200255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8200937Z ^ 2025-05-07T19:59:13.8201142Z 2025-05-07T19:59:13.8202006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8203486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8204141Z ^ 2025-05-07T19:59:13.8204289Z 2025-05-07T19:59:13.8204550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:13.8204937Z 2025-05-07T19:59:13.8205815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8207273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8207986Z ^ 2025-05-07T19:59:13.8208211Z 2025-05-07T19:59:13.8209081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8210568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8211215Z ^ 2025-05-07T19:59:13.8211368Z 2025-05-07T19:59:13.8211639Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:13.8212004Z 2025-05-07T19:59:13.8225226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.8226892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:13.8227556Z ^ 2025-05-07T19:59:13.8227780Z 2025-05-07T19:59:20.6129634Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:20.6151952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6154906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6156032Z ^ 2025-05-07T19:59:20.6156294Z 2025-05-07T19:59:20.6156714Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.6157387Z 2025-05-07T19:59:20.6158998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6161601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6162683Z ^ 2025-05-07T19:59:20.6163045Z 2025-05-07T19:59:20.6165054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6167773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6168883Z ^ 2025-05-07T19:59:20.6169124Z 2025-05-07T19:59:20.6169588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.6170227Z 2025-05-07T19:59:20.6171823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6174298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6175463Z ^ 2025-05-07T19:59:20.6175824Z 2025-05-07T19:59:20.6177402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6179935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6181033Z ^ 2025-05-07T19:59:20.6181286Z 2025-05-07T19:59:20.6181706Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.6182340Z 2025-05-07T19:59:20.6183843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6186451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6187847Z ^ 2025-05-07T19:59:20.6188216Z 2025-05-07T19:59:20.6189750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6192364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6193463Z ^ 2025-05-07T19:59:20.6193724Z 2025-05-07T19:59:20.6194186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.6194812Z 2025-05-07T19:59:20.6196420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6199212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6200409Z ^ 2025-05-07T19:59:20.6200766Z 2025-05-07T19:59:20.6202511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6205042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6206343Z ^ 2025-05-07T19:59:20.6206634Z 2025-05-07T19:59:20.6207267Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:20.6207940Z 2025-05-07T19:59:20.6209438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:20.6211947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:20.6213074Z ^ 2025-05-07T19:59:20.6213436Z 2025-05-07T19:59:30.6521358Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:30.6544509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6547572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6548759Z ^ 2025-05-07T19:59:30.6549017Z 2025-05-07T19:59:30.6549478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6550173Z 2025-05-07T19:59:30.6552246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6554873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6556007Z ^ 2025-05-07T19:59:30.6556404Z 2025-05-07T19:59:30.6557890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6560447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6561465Z ^ 2025-05-07T19:59:30.6561738Z 2025-05-07T19:59:30.6562153Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6562774Z 2025-05-07T19:59:30.6564525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6567461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6568671Z ^ 2025-05-07T19:59:30.6569050Z 2025-05-07T19:59:30.6570722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6573355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6574527Z ^ 2025-05-07T19:59:30.6574793Z 2025-05-07T19:59:30.6575258Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6575974Z 2025-05-07T19:59:30.6577631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6580550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6581724Z ^ 2025-05-07T19:59:30.6582246Z 2025-05-07T19:59:30.6583559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6585892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6586864Z ^ 2025-05-07T19:59:30.6587273Z 2025-05-07T19:59:30.6587725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6588382Z 2025-05-07T19:59:30.6590075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6592985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6594122Z ^ 2025-05-07T19:59:30.6594486Z 2025-05-07T19:59:30.6595989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6598512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6599614Z ^ 2025-05-07T19:59:30.6599893Z 2025-05-07T19:59:30.6600279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.6600970Z 2025-05-07T19:59:30.6602619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.6605309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.6606497Z ^ 2025-05-07T19:59:30.6606902Z 2025-05-07T19:59:30.7439585Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:30.7463607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7466571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7467767Z ^ 2025-05-07T19:59:30.7468033Z 2025-05-07T19:59:30.7468631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.7469321Z 2025-05-07T19:59:30.7471057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7473765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7474985Z ^ 2025-05-07T19:59:30.7475364Z 2025-05-07T19:59:30.7477047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7479516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7480667Z ^ 2025-05-07T19:59:30.7480904Z 2025-05-07T19:59:30.7481320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.7481940Z 2025-05-07T19:59:30.7483616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7486267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7487527Z ^ 2025-05-07T19:59:30.7487895Z 2025-05-07T19:59:30.7489585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7492095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7493410Z ^ 2025-05-07T19:59:30.7493705Z 2025-05-07T19:59:30.7494166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.7494844Z 2025-05-07T19:59:30.7496571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7499242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7500352Z ^ 2025-05-07T19:59:30.7500721Z 2025-05-07T19:59:30.7502276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7505122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7506366Z ^ 2025-05-07T19:59:30.7506639Z 2025-05-07T19:59:30.7507217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.7507936Z 2025-05-07T19:59:30.7509644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7512191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7513384Z ^ 2025-05-07T19:59:30.7513754Z 2025-05-07T19:59:30.7515325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7517940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7519030Z ^ 2025-05-07T19:59:30.7519286Z 2025-05-07T19:59:30.7519726Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:30.7520365Z 2025-05-07T19:59:30.7522027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:30.7524379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:30.7525544Z ^ 2025-05-07T19:59:30.7525874Z 2025-05-07T19:59:32.4855751Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:32.4880935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4883676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4884897Z ^ 2025-05-07T19:59:32.4885173Z 2025-05-07T19:59:32.4885664Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.4886469Z 2025-05-07T19:59:32.4888219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4891210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4892446Z ^ 2025-05-07T19:59:32.4892831Z 2025-05-07T19:59:32.4894560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4897395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4898635Z ^ 2025-05-07T19:59:32.4898903Z 2025-05-07T19:59:32.4899374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.4900079Z 2025-05-07T19:59:32.4901770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4904619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4905848Z ^ 2025-05-07T19:59:32.4906233Z 2025-05-07T19:59:32.4907962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4910963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4912300Z ^ 2025-05-07T19:59:32.4912527Z 2025-05-07T19:59:32.4912984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.4913673Z 2025-05-07T19:59:32.4915407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4918206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4919623Z ^ 2025-05-07T19:59:32.4920011Z 2025-05-07T19:59:32.4921815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4924532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4925667Z ^ 2025-05-07T19:59:32.4925953Z 2025-05-07T19:59:32.4926359Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.4927097Z 2025-05-07T19:59:32.4928746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4931507Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4932727Z ^ 2025-05-07T19:59:32.4933099Z 2025-05-07T19:59:32.4934759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4937591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4938819Z ^ 2025-05-07T19:59:32.4939085Z 2025-05-07T19:59:32.4939714Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:32.4940434Z 2025-05-07T19:59:32.4942143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.4945006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:32.4946101Z ^ 2025-05-07T19:59:32.4946511Z 2025-05-07T19:59:33.0314103Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:33.0338940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0341860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0343123Z ^ 2025-05-07T19:59:33.0343389Z 2025-05-07T19:59:33.0343871Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.0344492Z 2025-05-07T19:59:33.0346131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0348961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0350253Z ^ 2025-05-07T19:59:33.0350638Z 2025-05-07T19:59:33.0352572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0354891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0355901Z ^ 2025-05-07T19:59:33.0356130Z 2025-05-07T19:59:33.0356580Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.0357194Z 2025-05-07T19:59:33.0358874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0361867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0363089Z ^ 2025-05-07T19:59:33.0363462Z 2025-05-07T19:59:33.0365478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0368189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0369409Z ^ 2025-05-07T19:59:33.0369664Z 2025-05-07T19:59:33.0370122Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.0371047Z 2025-05-07T19:59:33.0372777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0375620Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0376816Z ^ 2025-05-07T19:59:33.0377209Z 2025-05-07T19:59:33.0378897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0381793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0382961Z ^ 2025-05-07T19:59:33.0383232Z 2025-05-07T19:59:33.0383711Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.0384389Z 2025-05-07T19:59:33.0386049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0388843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0390086Z ^ 2025-05-07T19:59:33.0390435Z 2025-05-07T19:59:33.0392238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0394805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0396006Z ^ 2025-05-07T19:59:33.0396263Z 2025-05-07T19:59:33.0396710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:33.0397405Z 2025-05-07T19:59:33.0399271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.0402029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:33.0403235Z ^ 2025-05-07T19:59:33.0403605Z 2025-05-07T19:59:40.3161643Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:40.3179956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3181810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3182652Z ^ 2025-05-07T19:59:40.3182844Z 2025-05-07T19:59:40.3183212Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.3183685Z 2025-05-07T19:59:40.3184802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3186623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3187459Z ^ 2025-05-07T19:59:40.3187716Z 2025-05-07T19:59:40.3188791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3190609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3191572Z ^ 2025-05-07T19:59:40.3191801Z 2025-05-07T19:59:40.3192118Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.3192818Z 2025-05-07T19:59:40.3193962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3195768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3196595Z ^ 2025-05-07T19:59:40.3196845Z 2025-05-07T19:59:40.3197987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3199910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3200746Z ^ 2025-05-07T19:59:40.3200923Z 2025-05-07T19:59:40.3201236Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.3201721Z 2025-05-07T19:59:40.3203143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3205053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3205883Z ^ 2025-05-07T19:59:40.3206162Z 2025-05-07T19:59:40.3207449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3209335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3210161Z ^ 2025-05-07T19:59:40.3210373Z 2025-05-07T19:59:40.3210698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.3211169Z 2025-05-07T19:59:40.3212352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3214206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3215077Z ^ 2025-05-07T19:59:40.3215347Z 2025-05-07T19:59:40.3216503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3218391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3219248Z ^ 2025-05-07T19:59:40.3219436Z 2025-05-07T19:59:40.3219760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.3220255Z 2025-05-07T19:59:40.3221395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.3223409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.3224230Z ^ 2025-05-07T19:59:40.3224649Z 2025-05-07T19:59:44.3989901Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:44.4014258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4016995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4018221Z ^ 2025-05-07T19:59:44.4018483Z 2025-05-07T19:59:44.4018967Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.4019652Z 2025-05-07T19:59:44.4021372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4024138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4025337Z ^ 2025-05-07T19:59:44.4025731Z 2025-05-07T19:59:44.4027410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4030458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4031771Z ^ 2025-05-07T19:59:44.4032062Z 2025-05-07T19:59:44.4032521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.4033284Z 2025-05-07T19:59:44.4035019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4037732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4038947Z ^ 2025-05-07T19:59:44.4039437Z 2025-05-07T19:59:44.4040997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4043629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4044922Z ^ 2025-05-07T19:59:44.4045180Z 2025-05-07T19:59:44.4045648Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.4046300Z 2025-05-07T19:59:44.4047975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4050866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4052021Z ^ 2025-05-07T19:59:44.4052375Z 2025-05-07T19:59:44.4053929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4056649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4057837Z ^ 2025-05-07T19:59:44.4058125Z 2025-05-07T19:59:44.4058582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.4059260Z 2025-05-07T19:59:44.4060968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4063676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4065199Z ^ 2025-05-07T19:59:44.4065586Z 2025-05-07T19:59:44.4067321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4070072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4071255Z ^ 2025-05-07T19:59:44.4071612Z 2025-05-07T19:59:44.4072053Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:44.4072759Z 2025-05-07T19:59:44.4074490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:44.4077397Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:44.4078612Z ^ 2025-05-07T19:59:44.4079022Z 2025-05-07T19:59:52.3712658Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:52.3735571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3738257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3739453Z ^ 2025-05-07T19:59:52.3739718Z 2025-05-07T19:59:52.3740154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.3740830Z 2025-05-07T19:59:52.3742436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3745012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3746534Z ^ 2025-05-07T19:59:52.3746897Z 2025-05-07T19:59:52.3748489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3750680Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.3751380Z ^ 2025-05-07T19:59:52.3751843Z 2025-05-07T19:59:52.3753288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3755181Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3755882Z ^ 2025-05-07T19:59:52.3756143Z 2025-05-07T19:59:52.3757573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3759455Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3760125Z ^ 2025-05-07T19:59:52.3760399Z 2025-05-07T19:59:52.3761828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3763666Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3764193Z ^ 2025-05-07T19:59:52.3764443Z 2025-05-07T19:59:52.3766486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3769146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3770302Z ^ 2025-05-07T19:59:52.3770549Z 2025-05-07T19:59:52.3770973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.3771607Z 2025-05-07T19:59:52.3773128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3775641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3776776Z ^ 2025-05-07T19:59:52.3777166Z 2025-05-07T19:59:52.3778619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3780676Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.3781434Z ^ 2025-05-07T19:59:52.3781713Z 2025-05-07T19:59:52.3783196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3785179Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3785720Z ^ 2025-05-07T19:59:52.3785997Z 2025-05-07T19:59:52.3787473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3789556Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3790117Z ^ 2025-05-07T19:59:52.3790389Z 2025-05-07T19:59:52.3792005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3793889Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3794446Z ^ 2025-05-07T19:59:52.3794714Z 2025-05-07T19:59:52.3796400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3798920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3800265Z ^ 2025-05-07T19:59:52.3800542Z 2025-05-07T19:59:52.3800936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.3801552Z 2025-05-07T19:59:52.3803291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3805883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3807078Z ^ 2025-05-07T19:59:52.3807441Z 2025-05-07T19:59:52.3809069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3811135Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.3811892Z ^ 2025-05-07T19:59:52.3812164Z 2025-05-07T19:59:52.3813519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3815387Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3815928Z ^ 2025-05-07T19:59:52.3816199Z 2025-05-07T19:59:52.3817677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3819524Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3820076Z ^ 2025-05-07T19:59:52.3820365Z 2025-05-07T19:59:52.3821785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3823695Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3824177Z ^ 2025-05-07T19:59:52.3824467Z 2025-05-07T19:59:52.3826015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3828689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3829978Z ^ 2025-05-07T19:59:52.3830245Z 2025-05-07T19:59:52.3830709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.3831709Z 2025-05-07T19:59:52.3833253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3835890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3837080Z ^ 2025-05-07T19:59:52.3837439Z 2025-05-07T19:59:52.3838910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3841016Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.3841943Z ^ 2025-05-07T19:59:52.3842215Z 2025-05-07T19:59:52.3843715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3845665Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3846332Z ^ 2025-05-07T19:59:52.3846624Z 2025-05-07T19:59:52.3848158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3850149Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3850663Z ^ 2025-05-07T19:59:52.3850944Z 2025-05-07T19:59:52.3852562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3854442Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3854963Z ^ 2025-05-07T19:59:52.3855222Z 2025-05-07T19:59:52.3856754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3859251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3860402Z ^ 2025-05-07T19:59:52.3860646Z 2025-05-07T19:59:52.3861082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.3861683Z 2025-05-07T19:59:52.3863244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.3866117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.3867299Z ^ 2025-05-07T19:59:52.3867652Z 2025-05-07T19:59:52.3869146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3871141Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:52.3872007Z ^ 2025-05-07T19:59:52.3872290Z 2025-05-07T19:59:52.3873686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3878579Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3879148Z ^ 2025-05-07T19:59:52.3879403Z 2025-05-07T19:59:52.3880872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3882788Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3883345Z ^ 2025-05-07T19:59:52.3883663Z 2025-05-07T19:59:52.3885152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:52.3887414Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:52.3887922Z ^ 2025-05-07T19:59:52.3888222Z 2025-05-07T20:00:00.4718259Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T20:00:00.4739597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4742078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4743503Z ^ 2025-05-07T20:00:00.4743775Z 2025-05-07T20:00:00.4744199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:00.4744815Z 2025-05-07T20:00:00.4746367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4748862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4749990Z ^ 2025-05-07T20:00:00.4750389Z 2025-05-07T20:00:00.4752089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4754451Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:00.4755147Z ^ 2025-05-07T20:00:00.4755453Z 2025-05-07T20:00:00.4756978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4758795Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4759282Z ^ 2025-05-07T20:00:00.4759546Z 2025-05-07T20:00:00.4761118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4762945Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4763489Z ^ 2025-05-07T20:00:00.4763760Z 2025-05-07T20:00:00.4771543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4773490Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4774019Z ^ 2025-05-07T20:00:00.4774283Z 2025-05-07T20:00:00.4775818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4778382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4779551Z ^ 2025-05-07T20:00:00.4779821Z 2025-05-07T20:00:00.4780232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:00.4780906Z 2025-05-07T20:00:00.4782469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4785002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4786106Z ^ 2025-05-07T20:00:00.4786450Z 2025-05-07T20:00:00.4787916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4789940Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:00.4790658Z ^ 2025-05-07T20:00:00.4791209Z 2025-05-07T20:00:00.4793026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4794855Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4795352Z ^ 2025-05-07T20:00:00.4795597Z 2025-05-07T20:00:00.4797018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4798799Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4799342Z ^ 2025-05-07T20:00:00.4799599Z 2025-05-07T20:00:00.4800938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4803017Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4803563Z ^ 2025-05-07T20:00:00.4803811Z 2025-05-07T20:00:00.4805478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4807968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4809063Z ^ 2025-05-07T20:00:00.4809338Z 2025-05-07T20:00:00.4809907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:00.4810551Z 2025-05-07T20:00:00.4812025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4814474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4815584Z ^ 2025-05-07T20:00:00.4815950Z 2025-05-07T20:00:00.4817415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4819300Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:00.4820045Z ^ 2025-05-07T20:00:00.4820336Z 2025-05-07T20:00:00.4821811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4823646Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4824187Z ^ 2025-05-07T20:00:00.4824479Z 2025-05-07T20:00:00.4825861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4827648Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4828188Z ^ 2025-05-07T20:00:00.4828436Z 2025-05-07T20:00:00.4829919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4832056Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4832580Z ^ 2025-05-07T20:00:00.4833035Z 2025-05-07T20:00:00.4834758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4837314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4838458Z ^ 2025-05-07T20:00:00.4838738Z 2025-05-07T20:00:00.4839164Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:00.4839773Z 2025-05-07T20:00:00.4841306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4843987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4845167Z ^ 2025-05-07T20:00:00.4845556Z 2025-05-07T20:00:00.4847089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4849173Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:00.4849947Z ^ 2025-05-07T20:00:00.4850230Z 2025-05-07T20:00:00.4851811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4853725Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4854412Z ^ 2025-05-07T20:00:00.4854713Z 2025-05-07T20:00:00.4856125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4857893Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4858437Z ^ 2025-05-07T20:00:00.4858744Z 2025-05-07T20:00:00.4860133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4861939Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4862456Z ^ 2025-05-07T20:00:00.4862725Z 2025-05-07T20:00:00.4864220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4867074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4868165Z ^ 2025-05-07T20:00:00.4868414Z 2025-05-07T20:00:00.4868848Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:00.4869391Z 2025-05-07T20:00:00.4870906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.4873523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:00.4874669Z ^ 2025-05-07T20:00:00.4874992Z 2025-05-07T20:00:00.4876368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4878755Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:00.4879453Z ^ 2025-05-07T20:00:00.4879750Z 2025-05-07T20:00:00.4881120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4882955Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4883464Z ^ 2025-05-07T20:00:00.4883776Z 2025-05-07T20:00:00.4885149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4887204Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4887726Z ^ 2025-05-07T20:00:00.4887980Z 2025-05-07T20:00:00.4889567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:00.4891361Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:00.4891873Z ^ 2025-05-07T20:00:00.4892138Z 2025-05-07T20:00:01.5795842Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:00:10.2489690Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:10.2512691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2515302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2516452Z ^ 2025-05-07T20:00:10.2516709Z 2025-05-07T20:00:10.2517156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:10.2517829Z 2025-05-07T20:00:10.2519510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2522149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2523297Z ^ 2025-05-07T20:00:10.2523674Z 2025-05-07T20:00:10.2525239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2527830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2528923Z ^ 2025-05-07T20:00:10.2529174Z 2025-05-07T20:00:10.2529622Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:10.2530264Z 2025-05-07T20:00:10.2531867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2534499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2535625Z ^ 2025-05-07T20:00:10.2535973Z 2025-05-07T20:00:10.2537537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2540001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2541120Z ^ 2025-05-07T20:00:10.2541371Z 2025-05-07T20:00:10.2541800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:10.2542638Z 2025-05-07T20:00:10.2544226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2546796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2547923Z ^ 2025-05-07T20:00:10.2548283Z 2025-05-07T20:00:10.2549925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2552973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2554154Z ^ 2025-05-07T20:00:10.2554415Z 2025-05-07T20:00:10.2554909Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:10.2555541Z 2025-05-07T20:00:10.2557290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2559971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2561160Z ^ 2025-05-07T20:00:10.2561537Z 2025-05-07T20:00:10.2563201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2566107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2567273Z ^ 2025-05-07T20:00:10.2567537Z 2025-05-07T20:00:10.2567992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:10.2568644Z 2025-05-07T20:00:10.2570259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.2572853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:10.2574048Z ^ 2025-05-07T20:00:10.2574419Z 2025-05-07T20:00:21.1450430Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:21.1471761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1474497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1475967Z ^ 2025-05-07T20:00:21.1476285Z 2025-05-07T20:00:21.1476712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.1477371Z 2025-05-07T20:00:21.1478984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1481585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1482753Z ^ 2025-05-07T20:00:21.1483095Z 2025-05-07T20:00:21.1484683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1487222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1488402Z ^ 2025-05-07T20:00:21.1488676Z 2025-05-07T20:00:21.1489139Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.1489779Z 2025-05-07T20:00:21.1491278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1493861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1495008Z ^ 2025-05-07T20:00:21.1495389Z 2025-05-07T20:00:21.1496979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1499596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1501031Z ^ 2025-05-07T20:00:21.1501286Z 2025-05-07T20:00:21.1501755Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.1502398Z 2025-05-07T20:00:21.1503792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1506008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1507076Z ^ 2025-05-07T20:00:21.1507408Z 2025-05-07T20:00:21.1508954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1511987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1513141Z ^ 2025-05-07T20:00:21.1513587Z 2025-05-07T20:00:21.1514042Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.1514627Z 2025-05-07T20:00:21.1516190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1518898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1520075Z ^ 2025-05-07T20:00:21.1520458Z 2025-05-07T20:00:21.1522090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1524608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1525998Z ^ 2025-05-07T20:00:21.1526241Z 2025-05-07T20:00:21.1526690Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:21.1527351Z 2025-05-07T20:00:21.1528895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1531320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:21.1532434Z ^ 2025-05-07T20:00:21.1532807Z 2025-05-07T20:00:28.2873040Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:28.2890111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2892050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2892925Z ^ 2025-05-07T20:00:28.2893120Z 2025-05-07T20:00:28.2893478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.2893982Z 2025-05-07T20:00:28.2895139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2897047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2897929Z ^ 2025-05-07T20:00:28.2898229Z 2025-05-07T20:00:28.2899368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2901274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2902122Z ^ 2025-05-07T20:00:28.2902366Z 2025-05-07T20:00:28.2902692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.2903166Z 2025-05-07T20:00:28.2904467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2906351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2907241Z ^ 2025-05-07T20:00:28.2907640Z 2025-05-07T20:00:28.2908833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2910695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2911732Z ^ 2025-05-07T20:00:28.2911921Z 2025-05-07T20:00:28.2912244Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.2912739Z 2025-05-07T20:00:28.2913931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2915937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2916785Z ^ 2025-05-07T20:00:28.2917080Z 2025-05-07T20:00:28.2918331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2920227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2921036Z ^ 2025-05-07T20:00:28.2921247Z 2025-05-07T20:00:28.2921568Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.2922056Z 2025-05-07T20:00:28.2923306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2925215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2926108Z ^ 2025-05-07T20:00:28.2926373Z 2025-05-07T20:00:28.2927687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2929566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2930411Z ^ 2025-05-07T20:00:28.2930614Z 2025-05-07T20:00:28.2930945Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.2931440Z 2025-05-07T20:00:28.2932582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.2934506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.2935360Z ^ 2025-05-07T20:00:28.2935645Z 2025-05-07T20:00:28.7488012Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:28.7509606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7512459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7513594Z ^ 2025-05-07T20:00:28.7513840Z 2025-05-07T20:00:28.7514262Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.7514870Z 2025-05-07T20:00:28.7516398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7518883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7519984Z ^ 2025-05-07T20:00:28.7520313Z 2025-05-07T20:00:28.7521806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7524265Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7525351Z ^ 2025-05-07T20:00:28.7525584Z 2025-05-07T20:00:28.7526044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.7526630Z 2025-05-07T20:00:28.7528160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7530903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7532073Z ^ 2025-05-07T20:00:28.7532426Z 2025-05-07T20:00:28.7533968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7536502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7537625Z ^ 2025-05-07T20:00:28.7538073Z 2025-05-07T20:00:28.7538467Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.7539083Z 2025-05-07T20:00:28.7540584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7543098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7544241Z ^ 2025-05-07T20:00:28.7544567Z 2025-05-07T20:00:28.7546035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7548536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7549610Z ^ 2025-05-07T20:00:28.7549847Z 2025-05-07T20:00:28.7550235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.7550853Z 2025-05-07T20:00:28.7552481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7554906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7555988Z ^ 2025-05-07T20:00:28.7556334Z 2025-05-07T20:00:28.7557798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7560274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7561343Z ^ 2025-05-07T20:00:28.7561591Z 2025-05-07T20:00:28.7561986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:28.7562549Z 2025-05-07T20:00:28.7564094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7572870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:28.7573934Z ^ 2025-05-07T20:00:28.7574266Z 2025-05-07T20:00:32.9207625Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:32.9229697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9232571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9233653Z ^ 2025-05-07T20:00:32.9233918Z 2025-05-07T20:00:32.9234352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9234972Z 2025-05-07T20:00:32.9236469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9238813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9239837Z ^ 2025-05-07T20:00:32.9240200Z 2025-05-07T20:00:32.9241852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9244255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9245667Z ^ 2025-05-07T20:00:32.9245944Z 2025-05-07T20:00:32.9246325Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9246863Z 2025-05-07T20:00:32.9248301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9250796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9251958Z ^ 2025-05-07T20:00:32.9252321Z 2025-05-07T20:00:32.9254033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9256897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9258111Z ^ 2025-05-07T20:00:32.9258363Z 2025-05-07T20:00:32.9258814Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9259531Z 2025-05-07T20:00:32.9261142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9263641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9265189Z ^ 2025-05-07T20:00:32.9265562Z 2025-05-07T20:00:32.9266915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9269102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9269966Z ^ 2025-05-07T20:00:32.9270212Z 2025-05-07T20:00:32.9270615Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9271183Z 2025-05-07T20:00:32.9272828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9275173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9276306Z ^ 2025-05-07T20:00:32.9276643Z 2025-05-07T20:00:32.9278191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9280712Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9281788Z ^ 2025-05-07T20:00:32.9282028Z 2025-05-07T20:00:32.9282457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:32.9283100Z 2025-05-07T20:00:32.9284561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.9287115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:32.9288538Z ^ 2025-05-07T20:00:32.9288916Z 2025-05-07T20:00:33.3236522Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:33.3259413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3262148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3263370Z ^ 2025-05-07T20:00:33.3263627Z 2025-05-07T20:00:33.3264076Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.3265063Z 2025-05-07T20:00:33.3266683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3269338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3270508Z ^ 2025-05-07T20:00:33.3270913Z 2025-05-07T20:00:33.3272634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3275475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3276477Z ^ 2025-05-07T20:00:33.3276704Z 2025-05-07T20:00:33.3277135Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.3277714Z 2025-05-07T20:00:33.3279147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3281612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3282981Z ^ 2025-05-07T20:00:33.3283341Z 2025-05-07T20:00:33.3284855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3287777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3288849Z ^ 2025-05-07T20:00:33.3289234Z 2025-05-07T20:00:33.3289638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.3290144Z 2025-05-07T20:00:33.3291894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3294385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3295485Z ^ 2025-05-07T20:00:33.3295846Z 2025-05-07T20:00:33.3297469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3300053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3301245Z ^ 2025-05-07T20:00:33.3301500Z 2025-05-07T20:00:33.3301981Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.3302622Z 2025-05-07T20:00:33.3304268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3306935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3308110Z ^ 2025-05-07T20:00:33.3308453Z 2025-05-07T20:00:33.3310102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3312930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3314396Z ^ 2025-05-07T20:00:33.3314687Z 2025-05-07T20:00:33.3315110Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:33.3315711Z 2025-05-07T20:00:33.3317116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.3319827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:33.3321065Z ^ 2025-05-07T20:00:33.3321397Z 2025-05-07T20:00:35.9668886Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:00:35.9691330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9694102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9695240Z ^ 2025-05-07T20:00:35.9695503Z 2025-05-07T20:00:35.9695964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:35.9696644Z 2025-05-07T20:00:35.9698204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9700951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9702426Z ^ 2025-05-07T20:00:35.9702806Z 2025-05-07T20:00:35.9704567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9707278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9708652Z ^ 2025-05-07T20:00:35.9708917Z 2025-05-07T20:00:35.9709453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:35.9710163Z 2025-05-07T20:00:35.9711948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9714998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9716464Z ^ 2025-05-07T20:00:35.9716836Z 2025-05-07T20:00:35.9718552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9721424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9722634Z ^ 2025-05-07T20:00:35.9722890Z 2025-05-07T20:00:35.9723339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:35.9724010Z 2025-05-07T20:00:35.9725668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9727980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9729043Z ^ 2025-05-07T20:00:35.9729374Z 2025-05-07T20:00:35.9730917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9733365Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9734471Z ^ 2025-05-07T20:00:35.9734746Z 2025-05-07T20:00:35.9735205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:35.9735808Z 2025-05-07T20:00:35.9737361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9739832Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9741018Z ^ 2025-05-07T20:00:35.9741371Z 2025-05-07T20:00:35.9742974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9745277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9746535Z ^ 2025-05-07T20:00:35.9746778Z 2025-05-07T20:00:35.9747196Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:35.9748026Z 2025-05-07T20:00:35.9749579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.9752483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:35.9753674Z ^ 2025-05-07T20:00:35.9754071Z 2025-05-07T20:00:36.5838368Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:36.5858318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5860679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5861698Z ^ 2025-05-07T20:00:36.5861936Z 2025-05-07T20:00:36.5862318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.5862908Z 2025-05-07T20:00:36.5864459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5866920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5867889Z ^ 2025-05-07T20:00:36.5868215Z 2025-05-07T20:00:36.5869661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5871923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5872999Z ^ 2025-05-07T20:00:36.5873242Z 2025-05-07T20:00:36.5873709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.5874265Z 2025-05-07T20:00:36.5875810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5877951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5878945Z ^ 2025-05-07T20:00:36.5879430Z 2025-05-07T20:00:36.5881020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5883362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5884422Z ^ 2025-05-07T20:00:36.5884663Z 2025-05-07T20:00:36.5885096Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.5885676Z 2025-05-07T20:00:36.5887011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5889109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5890101Z ^ 2025-05-07T20:00:36.5890457Z 2025-05-07T20:00:36.5892048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5894449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5895460Z ^ 2025-05-07T20:00:36.5895658Z 2025-05-07T20:00:36.5896051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.5896660Z 2025-05-07T20:00:36.5897937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5900020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5900941Z ^ 2025-05-07T20:00:36.5901281Z 2025-05-07T20:00:36.5902745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5905202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5906115Z ^ 2025-05-07T20:00:36.5906356Z 2025-05-07T20:00:36.5906728Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:36.5907239Z 2025-05-07T20:00:36.5908565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5910768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:36.5911919Z ^ 2025-05-07T20:00:36.5912259Z 2025-05-07T20:00:43.9680500Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:43.9704182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9706942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9708367Z ^ 2025-05-07T20:00:43.9708625Z 2025-05-07T20:00:43.9709057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.9709760Z 2025-05-07T20:00:43.9711513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9714171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9715417Z ^ 2025-05-07T20:00:43.9740027Z 2025-05-07T20:00:43.9741834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9744990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9746218Z ^ 2025-05-07T20:00:43.9746488Z 2025-05-07T20:00:43.9747072Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.9747760Z 2025-05-07T20:00:43.9749384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9752632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9753896Z ^ 2025-05-07T20:00:43.9754296Z 2025-05-07T20:00:43.9756032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9759077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9760204Z ^ 2025-05-07T20:00:43.9760464Z 2025-05-07T20:00:43.9760923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.9761609Z 2025-05-07T20:00:43.9763307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9766217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9767414Z ^ 2025-05-07T20:00:43.9767820Z 2025-05-07T20:00:43.9769445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9772224Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9773438Z ^ 2025-05-07T20:00:43.9773703Z 2025-05-07T20:00:43.9774097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.9774732Z 2025-05-07T20:00:43.9776436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9779371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9780607Z ^ 2025-05-07T20:00:43.9780978Z 2025-05-07T20:00:43.9782666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9785345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9786581Z ^ 2025-05-07T20:00:43.9786848Z 2025-05-07T20:00:43.9787305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:43.9788200Z 2025-05-07T20:00:43.9789877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.9792729Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:43.9794059Z ^ 2025-05-07T20:00:43.9794449Z 2025-05-07T20:00:47.4696280Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:00:47.4719789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4722923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4724096Z ^ 2025-05-07T20:00:47.4724377Z 2025-05-07T20:00:47.4724811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.4725477Z 2025-05-07T20:00:47.4727261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4730062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4731504Z ^ 2025-05-07T20:00:47.4731869Z 2025-05-07T20:00:47.4733801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4736372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4737585Z ^ 2025-05-07T20:00:47.4737828Z 2025-05-07T20:00:47.4738308Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.4738988Z 2025-05-07T20:00:47.4740858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4743690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4744843Z ^ 2025-05-07T20:00:47.4745242Z 2025-05-07T20:00:47.4746904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4749664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4750849Z ^ 2025-05-07T20:00:47.4751143Z 2025-05-07T20:00:47.4751761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.4752453Z 2025-05-07T20:00:47.4754386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4757047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4758330Z ^ 2025-05-07T20:00:47.4758728Z 2025-05-07T20:00:47.4760494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4763305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4764542Z ^ 2025-05-07T20:00:47.4765070Z 2025-05-07T20:00:47.4765534Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.4766474Z 2025-05-07T20:00:47.4768232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4771072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4772313Z ^ 2025-05-07T20:00:47.4772722Z 2025-05-07T20:00:47.4774463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4777292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4778676Z ^ 2025-05-07T20:00:47.4778973Z 2025-05-07T20:00:47.4779426Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.4780121Z 2025-05-07T20:00:47.4781997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.4784860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.4786129Z ^ 2025-05-07T20:00:47.4786507Z 2025-05-07T20:00:47.9908953Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:00:47.9930701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9933437Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9934509Z ^ 2025-05-07T20:00:47.9934774Z 2025-05-07T20:00:47.9935206Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9935833Z 2025-05-07T20:00:47.9937787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9940428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9941740Z ^ 2025-05-07T20:00:47.9942113Z 2025-05-07T20:00:47.9943663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9946206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9947521Z ^ 2025-05-07T20:00:47.9947781Z 2025-05-07T20:00:47.9948250Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9948923Z 2025-05-07T20:00:47.9950442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9953271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9954425Z ^ 2025-05-07T20:00:47.9954807Z 2025-05-07T20:00:47.9956362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9958857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9959973Z ^ 2025-05-07T20:00:47.9960264Z 2025-05-07T20:00:47.9960708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9961362Z 2025-05-07T20:00:47.9963028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9965867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9966961Z ^ 2025-05-07T20:00:47.9967299Z 2025-05-07T20:00:47.9968813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9971068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9972423Z ^ 2025-05-07T20:00:47.9972682Z 2025-05-07T20:00:47.9973123Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9973778Z 2025-05-07T20:00:47.9975330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9977760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9978801Z ^ 2025-05-07T20:00:47.9979191Z 2025-05-07T20:00:47.9980718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9983475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9984553Z ^ 2025-05-07T20:00:47.9984801Z 2025-05-07T20:00:47.9987643Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9988427Z 2025-05-07T20:00:47.9989970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9992850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9994031Z ^ 2025-05-07T20:00:47.9994384Z 2025-05-07T20:00:53.7329130Z [335/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:00:53.7349484Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.1551583Z [336/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:00:57.1572088Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.8977680Z [337/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:00:57.8999384Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:00.4043059Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:00.4066385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4069087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4073888Z ^ 2025-05-07T20:01:00.4074162Z 2025-05-07T20:01:00.4074647Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4075336Z 2025-05-07T20:01:00.4077175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4080013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4081240Z ^ 2025-05-07T20:01:00.4081632Z 2025-05-07T20:01:00.4083623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4086357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4087431Z ^ 2025-05-07T20:01:00.4087830Z 2025-05-07T20:01:00.4088274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4088938Z 2025-05-07T20:01:00.4090563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4093097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4094114Z ^ 2025-05-07T20:01:00.4094421Z 2025-05-07T20:01:00.4095825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4098145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4099337Z ^ 2025-05-07T20:01:00.4099600Z 2025-05-07T20:01:00.4100089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4100774Z 2025-05-07T20:01:00.4102479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4105103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4106217Z ^ 2025-05-07T20:01:00.4106560Z 2025-05-07T20:01:00.4108167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4110829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4112182Z ^ 2025-05-07T20:01:00.4112455Z 2025-05-07T20:01:00.4112906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4113591Z 2025-05-07T20:01:00.4115245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4118375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4119585Z ^ 2025-05-07T20:01:00.4119964Z 2025-05-07T20:01:00.4121635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4124287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4125440Z ^ 2025-05-07T20:01:00.4125875Z 2025-05-07T20:01:00.4126320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4127046Z 2025-05-07T20:01:00.4128639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4131525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4132776Z ^ 2025-05-07T20:01:00.4133144Z 2025-05-07T20:01:03.0783430Z [339/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:03.0804488Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.2774145Z [340/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:04.2797800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2800608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2801844Z ^ 2025-05-07T20:01:04.2802154Z 2025-05-07T20:01:04.2802624Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.2803312Z 2025-05-07T20:01:04.2805011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2807770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2809019Z ^ 2025-05-07T20:01:04.2809387Z 2025-05-07T20:01:04.2811159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2813928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2815110Z ^ 2025-05-07T20:01:04.2815683Z 2025-05-07T20:01:04.2816133Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.2816814Z 2025-05-07T20:01:04.2818492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2821256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2822473Z ^ 2025-05-07T20:01:04.2822845Z 2025-05-07T20:01:04.2824549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2827414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2828637Z ^ 2025-05-07T20:01:04.2828902Z 2025-05-07T20:01:04.2829311Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.2829834Z 2025-05-07T20:01:04.2831756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2834524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2835781Z ^ 2025-05-07T20:01:04.2836286Z 2025-05-07T20:01:04.2838018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2840866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2842131Z ^ 2025-05-07T20:01:04.2842412Z 2025-05-07T20:01:04.2842914Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.2843576Z 2025-05-07T20:01:04.2845387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2848316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2849559Z ^ 2025-05-07T20:01:04.2849936Z 2025-05-07T20:01:04.2851666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2854501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2855646Z ^ 2025-05-07T20:01:04.2855899Z 2025-05-07T20:01:04.2856302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.2856951Z 2025-05-07T20:01:04.2858715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.2861349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.2862758Z ^ 2025-05-07T20:01:04.2863141Z 2025-05-07T20:01:04.3420272Z [341/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:04.3443306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3445960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3447106Z ^ 2025-05-07T20:01:04.3447353Z 2025-05-07T20:01:04.3447790Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.3448459Z 2025-05-07T20:01:04.3450021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3452335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3453352Z ^ 2025-05-07T20:01:04.3453753Z 2025-05-07T20:01:04.3455439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3458239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3459340Z ^ 2025-05-07T20:01:04.3459629Z 2025-05-07T20:01:04.3460058Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.3460681Z 2025-05-07T20:01:04.3462323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3465366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3466783Z ^ 2025-05-07T20:01:04.3467139Z 2025-05-07T20:01:04.3468761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3471719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3472886Z ^ 2025-05-07T20:01:04.3473126Z 2025-05-07T20:01:04.3473576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.3474263Z 2025-05-07T20:01:04.3476024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3478625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3479790Z ^ 2025-05-07T20:01:04.3480177Z 2025-05-07T20:01:04.3481663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3483769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3484783Z ^ 2025-05-07T20:01:04.3485002Z 2025-05-07T20:01:04.3485409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.3485923Z 2025-05-07T20:01:04.3487250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3489510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3490582Z ^ 2025-05-07T20:01:04.3490925Z 2025-05-07T20:01:04.3492521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3494782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3495877Z ^ 2025-05-07T20:01:04.3496077Z 2025-05-07T20:01:04.3496437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:04.3497083Z 2025-05-07T20:01:04.3498646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.3501254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:04.3502265Z ^ 2025-05-07T20:01:04.3502565Z 2025-05-07T20:01:05.6124157Z [342/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:05.6147233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6149961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6150999Z ^ 2025-05-07T20:01:05.6151264Z 2025-05-07T20:01:05.6151838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.6152527Z 2025-05-07T20:01:05.6154402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6157436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6159026Z ^ 2025-05-07T20:01:05.6159362Z 2025-05-07T20:01:05.6161036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6163677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6165309Z ^ 2025-05-07T20:01:05.6165610Z 2025-05-07T20:01:05.6166109Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.6166790Z 2025-05-07T20:01:05.6168815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6171370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6172898Z ^ 2025-05-07T20:01:05.6173242Z 2025-05-07T20:01:05.6174752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6177800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6179221Z ^ 2025-05-07T20:01:05.6179505Z 2025-05-07T20:01:05.6179933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.6180663Z 2025-05-07T20:01:05.6182580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6185517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6186790Z ^ 2025-05-07T20:01:05.6187220Z 2025-05-07T20:01:05.6189012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6192067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6193437Z ^ 2025-05-07T20:01:05.6193714Z 2025-05-07T20:01:05.6194214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.6194904Z 2025-05-07T20:01:05.6196790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6199600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6200817Z ^ 2025-05-07T20:01:05.6201170Z 2025-05-07T20:01:05.6202692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6205139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6206511Z ^ 2025-05-07T20:01:05.6206776Z 2025-05-07T20:01:05.6207238Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:05.6207960Z 2025-05-07T20:01:05.6209623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:05.6211777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:05.6212771Z ^ 2025-05-07T20:01:05.6213101Z 2025-05-07T20:01:06.3692720Z [343/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:06.3711817Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:07.1520361Z [344/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:07.1536833Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:07.8229462Z [345/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:07.8254048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8256883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8258147Z ^ 2025-05-07T20:01:07.8258668Z 2025-05-07T20:01:07.8259072Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.8259793Z 2025-05-07T20:01:07.8261459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8264570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8266199Z ^ 2025-05-07T20:01:07.8266591Z 2025-05-07T20:01:07.8268299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8271452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8272654Z ^ 2025-05-07T20:01:07.8272918Z 2025-05-07T20:01:07.8273268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.8273959Z 2025-05-07T20:01:07.8275746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8278572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8279819Z ^ 2025-05-07T20:01:07.8280201Z 2025-05-07T20:01:07.8281936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8284764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8286034Z ^ 2025-05-07T20:01:07.8286301Z 2025-05-07T20:01:07.8286763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.8287477Z 2025-05-07T20:01:07.8289228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8292073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8293299Z ^ 2025-05-07T20:01:07.8293723Z 2025-05-07T20:01:07.8295457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8298044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8299211Z ^ 2025-05-07T20:01:07.8299478Z 2025-05-07T20:01:07.8299971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.8300646Z 2025-05-07T20:01:07.8302509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8305306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8306728Z ^ 2025-05-07T20:01:07.8307092Z 2025-05-07T20:01:07.8308777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8311688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8312708Z ^ 2025-05-07T20:01:07.8312930Z 2025-05-07T20:01:07.8313309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:07.8313874Z 2025-05-07T20:01:07.8315285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.8317400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:07.8318378Z ^ 2025-05-07T20:01:07.8318697Z 2025-05-07T20:01:09.3936884Z [346/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:09.3958021Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:11.9878985Z [347/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:11.9899450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9901772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9903028Z ^ 2025-05-07T20:01:11.9903256Z 2025-05-07T20:01:11.9903646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:11.9904188Z 2025-05-07T20:01:11.9905593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9907858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9909109Z ^ 2025-05-07T20:01:11.9909410Z 2025-05-07T20:01:11.9910818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9913472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9914485Z ^ 2025-05-07T20:01:11.9914711Z 2025-05-07T20:01:11.9915105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:11.9915672Z 2025-05-07T20:01:11.9917321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9919596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9920834Z ^ 2025-05-07T20:01:11.9921248Z 2025-05-07T20:01:11.9922590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9924915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9926121Z ^ 2025-05-07T20:01:11.9926350Z 2025-05-07T20:01:11.9926744Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:11.9927394Z 2025-05-07T20:01:11.9928848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9931045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9932063Z ^ 2025-05-07T20:01:11.9932359Z 2025-05-07T20:01:11.9933718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9936077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9937159Z ^ 2025-05-07T20:01:11.9937379Z 2025-05-07T20:01:11.9937807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:11.9938394Z 2025-05-07T20:01:11.9939796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9942468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9943458Z ^ 2025-05-07T20:01:11.9943800Z 2025-05-07T20:01:11.9945209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9947482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9948828Z ^ 2025-05-07T20:01:11.9949089Z 2025-05-07T20:01:11.9949457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:11.9949999Z 2025-05-07T20:01:11.9951520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.9953816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:11.9954842Z ^ 2025-05-07T20:01:11.9955170Z 2025-05-07T20:01:12.0381083Z [348/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:12.0398405Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:13.3430806Z [349/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:13.3451505Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:13.9540060Z [350/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:13.9559831Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:14.8054273Z [351/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:14.8076242Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:15.4940464Z [352/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:15.4956913Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:17.7396689Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:17.7418568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7421277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7422401Z ^ 2025-05-07T20:01:17.7422672Z 2025-05-07T20:01:17.7423140Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.7423827Z 2025-05-07T20:01:17.7425232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7427909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7428997Z ^ 2025-05-07T20:01:17.7429398Z 2025-05-07T20:01:17.7431010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7433818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7434883Z ^ 2025-05-07T20:01:17.7435141Z 2025-05-07T20:01:17.7435596Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.7436326Z 2025-05-07T20:01:17.7438105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7440828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7442010Z ^ 2025-05-07T20:01:17.7442352Z 2025-05-07T20:01:17.7444183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7446814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7447907Z ^ 2025-05-07T20:01:17.7448147Z 2025-05-07T20:01:17.7448522Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.7449095Z 2025-05-07T20:01:17.7450581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7453394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7454674Z ^ 2025-05-07T20:01:17.7455004Z 2025-05-07T20:01:17.7456685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7459599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7460675Z ^ 2025-05-07T20:01:17.7460924Z 2025-05-07T20:01:17.7461319Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.7461880Z 2025-05-07T20:01:17.7463472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7466518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7467718Z ^ 2025-05-07T20:01:17.7468070Z 2025-05-07T20:01:17.7469798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7472656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7473688Z ^ 2025-05-07T20:01:17.7473899Z 2025-05-07T20:01:17.7474266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:17.7474842Z 2025-05-07T20:01:17.7476111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.7478464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:17.7479645Z ^ 2025-05-07T20:01:17.7480245Z 2025-05-07T20:01:18.5339811Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:18.5363658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5366558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5367768Z ^ 2025-05-07T20:01:18.5368317Z 2025-05-07T20:01:18.5368765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5369459Z 2025-05-07T20:01:18.5371154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5373901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5375121Z ^ 2025-05-07T20:01:18.5375495Z 2025-05-07T20:01:18.5377193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5380129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5381344Z ^ 2025-05-07T20:01:18.5381612Z 2025-05-07T20:01:18.5382090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5382768Z 2025-05-07T20:01:18.5384526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5387253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5388425Z ^ 2025-05-07T20:01:18.5388986Z 2025-05-07T20:01:18.5390659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5393450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5394536Z ^ 2025-05-07T20:01:18.5394830Z 2025-05-07T20:01:18.5395232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5395885Z 2025-05-07T20:01:18.5397637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5400478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5401701Z ^ 2025-05-07T20:01:18.5402066Z 2025-05-07T20:01:18.5403687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5406288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5407362Z ^ 2025-05-07T20:01:18.5407603Z 2025-05-07T20:01:18.5408022Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5408585Z 2025-05-07T20:01:18.5410216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5412814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5414196Z ^ 2025-05-07T20:01:18.5414574Z 2025-05-07T20:01:18.5416263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5418963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5420088Z ^ 2025-05-07T20:01:18.5420356Z 2025-05-07T20:01:18.5420795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:18.5421472Z 2025-05-07T20:01:18.5423155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5425979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:18.5427185Z ^ 2025-05-07T20:01:18.5427545Z 2025-05-07T20:01:21.6143422Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:21.6167300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6170172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6171345Z ^ 2025-05-07T20:01:21.6171599Z 2025-05-07T20:01:21.6172049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.6172699Z 2025-05-07T20:01:21.6174315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6177018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6178373Z ^ 2025-05-07T20:01:21.6178759Z 2025-05-07T20:01:21.6180577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6183746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6184974Z ^ 2025-05-07T20:01:21.6185237Z 2025-05-07T20:01:21.6185716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.6186419Z 2025-05-07T20:01:21.6188290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6191138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6192551Z ^ 2025-05-07T20:01:21.6192925Z 2025-05-07T20:01:21.6194614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6197430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6198617Z ^ 2025-05-07T20:01:21.6198861Z 2025-05-07T20:01:21.6199312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.6199980Z 2025-05-07T20:01:21.6201885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6204775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6206021Z ^ 2025-05-07T20:01:21.6206411Z 2025-05-07T20:01:21.6208130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6210988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6212251Z ^ 2025-05-07T20:01:21.6212482Z 2025-05-07T20:01:21.6212935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.6213651Z 2025-05-07T20:01:21.6215403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6218495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6219715Z ^ 2025-05-07T20:01:21.6220105Z 2025-05-07T20:01:21.6222059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6224866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6226046Z ^ 2025-05-07T20:01:21.6226482Z 2025-05-07T20:01:21.6226944Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.6227628Z 2025-05-07T20:01:21.6229423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.6232511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.6233767Z ^ 2025-05-07T20:01:21.6234137Z 2025-05-07T20:01:22.1201686Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:01:22.1226255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1229088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1230431Z ^ 2025-05-07T20:01:22.1230686Z 2025-05-07T20:01:22.1231165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.1232025Z 2025-05-07T20:01:22.1233710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1236670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1237799Z ^ 2025-05-07T20:01:22.1238149Z 2025-05-07T20:01:22.1239849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1242495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1243694Z ^ 2025-05-07T20:01:22.1243960Z 2025-05-07T20:01:22.1244510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.1245205Z 2025-05-07T20:01:22.1246963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1249920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1251144Z ^ 2025-05-07T20:01:22.1251542Z 2025-05-07T20:01:22.1253219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1256013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1257224Z ^ 2025-05-07T20:01:22.1257489Z 2025-05-07T20:01:22.1257961Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.1258661Z 2025-05-07T20:01:22.1260398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1263189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1264455Z ^ 2025-05-07T20:01:22.1265170Z 2025-05-07T20:01:22.1266822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1269506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1270942Z ^ 2025-05-07T20:01:22.1271195Z 2025-05-07T20:01:22.1271734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.1272351Z 2025-05-07T20:01:22.1273990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1276614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1277780Z ^ 2025-05-07T20:01:22.1278139Z 2025-05-07T20:01:22.1279810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1282609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1283790Z ^ 2025-05-07T20:01:22.1284075Z 2025-05-07T20:01:22.1284617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.1285316Z 2025-05-07T20:01:22.1286940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.1289544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.1291092Z ^ 2025-05-07T20:01:22.1291483Z 2025-05-07T20:01:22.5022460Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:22.5047061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5049830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5051312Z ^ 2025-05-07T20:01:22.5051609Z 2025-05-07T20:01:22.5052071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.5052754Z 2025-05-07T20:01:22.5054724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5057518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5058773Z ^ 2025-05-07T20:01:22.5059155Z 2025-05-07T20:01:22.5060995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5063795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5065348Z ^ 2025-05-07T20:01:22.5065616Z 2025-05-07T20:01:22.5066068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.5066649Z 2025-05-07T20:01:22.5068205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5070970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5072345Z ^ 2025-05-07T20:01:22.5072757Z 2025-05-07T20:01:22.5074350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5077001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5078132Z ^ 2025-05-07T20:01:22.5078383Z 2025-05-07T20:01:22.5078856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.5079522Z 2025-05-07T20:01:22.5081149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5084014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5085303Z ^ 2025-05-07T20:01:22.5085675Z 2025-05-07T20:01:22.5087366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5090512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5091655Z ^ 2025-05-07T20:01:22.5091920Z 2025-05-07T20:01:22.5092376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.5093066Z 2025-05-07T20:01:22.5094785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5097557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5099077Z ^ 2025-05-07T20:01:22.5099446Z 2025-05-07T20:01:22.5102596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5105408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5106596Z ^ 2025-05-07T20:01:22.5106859Z 2025-05-07T20:01:22.5107343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:22.5107946Z 2025-05-07T20:01:22.5109809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:22.5112800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:22.5114099Z ^ 2025-05-07T20:01:22.5114448Z 2025-05-07T20:01:23.5707335Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:23.5733813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5736665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5738110Z ^ 2025-05-07T20:01:23.5738392Z 2025-05-07T20:01:23.5738867Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.5739570Z 2025-05-07T20:01:23.5741346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5744277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5745483Z ^ 2025-05-07T20:01:23.5745846Z 2025-05-07T20:01:23.5747572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5750256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5751617Z ^ 2025-05-07T20:01:23.5751879Z 2025-05-07T20:01:23.5752367Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.5753049Z 2025-05-07T20:01:23.5754767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5757547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5758806Z ^ 2025-05-07T20:01:23.5759188Z 2025-05-07T20:01:23.5760897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5763698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5765160Z ^ 2025-05-07T20:01:23.5765439Z 2025-05-07T20:01:23.5765905Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.5766772Z 2025-05-07T20:01:23.5768523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5771395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5772581Z ^ 2025-05-07T20:01:23.5772937Z 2025-05-07T20:01:23.5774564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5777087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5778123Z ^ 2025-05-07T20:01:23.5778355Z 2025-05-07T20:01:23.5778727Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.5779492Z 2025-05-07T20:01:23.5781242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5784400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5785652Z ^ 2025-05-07T20:01:23.5786057Z 2025-05-07T20:01:23.5787947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5790841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5792265Z ^ 2025-05-07T20:01:23.5792555Z 2025-05-07T20:01:23.5793030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:23.5793739Z 2025-05-07T20:01:23.5795535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:23.5798283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:23.5799551Z ^ 2025-05-07T20:01:23.5799933Z 2025-05-07T20:01:29.8826865Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:01:29.8850377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8853291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8854548Z ^ 2025-05-07T20:01:29.8854818Z 2025-05-07T20:01:29.8855432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.8856049Z 2025-05-07T20:01:29.8857703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8860446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8861692Z ^ 2025-05-07T20:01:29.8862062Z 2025-05-07T20:01:29.8863753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8866775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8867962Z ^ 2025-05-07T20:01:29.8868211Z 2025-05-07T20:01:29.8868702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.8869372Z 2025-05-07T20:01:29.8871105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8874067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8875307Z ^ 2025-05-07T20:01:29.8875689Z 2025-05-07T20:01:29.8877272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8880032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8881195Z ^ 2025-05-07T20:01:29.8881722Z 2025-05-07T20:01:29.8882162Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.8882829Z 2025-05-07T20:01:29.8884538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8887312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8888552Z ^ 2025-05-07T20:01:29.8888925Z 2025-05-07T20:01:29.8890698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8893642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8894887Z ^ 2025-05-07T20:01:29.8895145Z 2025-05-07T20:01:29.8895613Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.8896341Z 2025-05-07T20:01:29.8898180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8900975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8902194Z ^ 2025-05-07T20:01:29.8902668Z 2025-05-07T20:01:29.8904381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8907175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8908385Z ^ 2025-05-07T20:01:29.8908676Z 2025-05-07T20:01:29.8909142Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:29.8909835Z 2025-05-07T20:01:29.8911769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.8914587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:29.8915823Z ^ 2025-05-07T20:01:29.8916197Z 2025-05-07T20:01:30.1922333Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:30.1944579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1947001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1948100Z ^ 2025-05-07T20:01:30.1948356Z 2025-05-07T20:01:30.1948785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.1949397Z 2025-05-07T20:01:30.1950873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1953366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1954475Z ^ 2025-05-07T20:01:30.1954824Z 2025-05-07T20:01:30.1956299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1958660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1959735Z ^ 2025-05-07T20:01:30.1959995Z 2025-05-07T20:01:30.1960465Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.1961059Z 2025-05-07T20:01:30.1962517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1965204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1966317Z ^ 2025-05-07T20:01:30.1966697Z 2025-05-07T20:01:30.1968080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1970688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1971731Z ^ 2025-05-07T20:01:30.1972019Z 2025-05-07T20:01:30.1972446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.1973048Z 2025-05-07T20:01:30.1974636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1977066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1978401Z ^ 2025-05-07T20:01:30.1978931Z 2025-05-07T20:01:30.1980427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1982716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1983671Z ^ 2025-05-07T20:01:30.1983880Z 2025-05-07T20:01:30.1984244Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.1984799Z 2025-05-07T20:01:30.1986307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1988746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1989776Z ^ 2025-05-07T20:01:30.1990121Z 2025-05-07T20:01:30.1991697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1993915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.1994862Z ^ 2025-05-07T20:01:30.1995100Z 2025-05-07T20:01:30.1995460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:30.1995993Z 2025-05-07T20:01:30.1997352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:30.1999653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:30.2000679Z ^ 2025-05-07T20:01:30.2001036Z 2025-05-07T20:01:33.3627958Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:33.3648711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3651071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3652063Z ^ 2025-05-07T20:01:33.3652284Z 2025-05-07T20:01:33.3652657Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.3653242Z 2025-05-07T20:01:33.3654627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3656852Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3657854Z ^ 2025-05-07T20:01:33.3658174Z 2025-05-07T20:01:33.3659588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3661804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3662741Z ^ 2025-05-07T20:01:33.3662977Z 2025-05-07T20:01:33.3663341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.3663888Z 2025-05-07T20:01:33.3665595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3667765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3669020Z ^ 2025-05-07T20:01:33.3669364Z 2025-05-07T20:01:33.3670738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3673025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3674018Z ^ 2025-05-07T20:01:33.3674241Z 2025-05-07T20:01:33.3674648Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.3675198Z 2025-05-07T20:01:33.3676712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3678926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3680043Z ^ 2025-05-07T20:01:33.3680355Z 2025-05-07T20:01:33.3681697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3683986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3684974Z ^ 2025-05-07T20:01:33.3685213Z 2025-05-07T20:01:33.3685551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.3686082Z 2025-05-07T20:01:33.3687484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3689829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3690953Z ^ 2025-05-07T20:01:33.3691306Z 2025-05-07T20:01:33.3692863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3695444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3696557Z ^ 2025-05-07T20:01:33.3696798Z 2025-05-07T20:01:33.3697227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.3697907Z 2025-05-07T20:01:33.3699517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.3702048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.3703169Z ^ 2025-05-07T20:01:33.3703542Z 2025-05-07T20:01:35.7474982Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:35.7500791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7503805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7505144Z ^ 2025-05-07T20:01:35.7505450Z 2025-05-07T20:01:35.7505957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:35.7506735Z 2025-05-07T20:01:35.7508562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7511750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7513075Z ^ 2025-05-07T20:01:35.7513521Z 2025-05-07T20:01:35.7515305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7518241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7519541Z ^ 2025-05-07T20:01:35.7519848Z 2025-05-07T20:01:35.7520381Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:35.7521113Z 2025-05-07T20:01:35.7522918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7526028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7527381Z ^ 2025-05-07T20:01:35.7527787Z 2025-05-07T20:01:35.7529563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7532495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7533878Z ^ 2025-05-07T20:01:35.7534174Z 2025-05-07T20:01:35.7534680Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:35.7535398Z 2025-05-07T20:01:35.7537357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7540291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7541632Z ^ 2025-05-07T20:01:35.7541984Z 2025-05-07T20:01:35.7543883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7546779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7548091Z ^ 2025-05-07T20:01:35.7548376Z 2025-05-07T20:01:35.7548892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:35.7549616Z 2025-05-07T20:01:35.7551657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7554605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7555913Z ^ 2025-05-07T20:01:35.7556348Z 2025-05-07T20:01:35.7558124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7561047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7562340Z ^ 2025-05-07T20:01:35.7562648Z 2025-05-07T20:01:35.7563147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:35.7563876Z 2025-05-07T20:01:35.7565984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.7568844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:35.7570184Z ^ 2025-05-07T20:01:35.7570585Z 2025-05-07T20:01:46.6760418Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:46.6785406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6788448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6789913Z ^ 2025-05-07T20:01:46.6790200Z 2025-05-07T20:01:46.6790666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:46.6791549Z 2025-05-07T20:01:46.6793289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6796172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6797240Z ^ 2025-05-07T20:01:46.6797609Z 2025-05-07T20:01:46.6799238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6802045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6803608Z ^ 2025-05-07T20:01:46.6803879Z 2025-05-07T20:01:46.6804371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:46.6805066Z 2025-05-07T20:01:46.6806970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6809689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6810874Z ^ 2025-05-07T20:01:46.6811234Z 2025-05-07T20:01:46.6813038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6815794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6816976Z ^ 2025-05-07T20:01:46.6817441Z 2025-05-07T20:01:46.6817900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:46.6818580Z 2025-05-07T20:01:46.6820270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6823182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6824452Z ^ 2025-05-07T20:01:46.6824827Z 2025-05-07T20:01:46.6826580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6829328Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6844166Z ^ 2025-05-07T20:01:46.6844472Z 2025-05-07T20:01:46.6844950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:46.6845637Z 2025-05-07T20:01:46.6847332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6850156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6851395Z ^ 2025-05-07T20:01:46.6851815Z 2025-05-07T20:01:46.6853553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6856333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6857546Z ^ 2025-05-07T20:01:46.6857849Z 2025-05-07T20:01:46.6858315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:46.6859031Z 2025-05-07T20:01:46.6860825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.6863927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:46.6865509Z ^ 2025-05-07T20:01:46.6865899Z 2025-05-07T20:01:52.3286800Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:01:52.3311604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3314655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3315953Z ^ 2025-05-07T20:01:52.3316272Z 2025-05-07T20:01:52.3316763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:52.3317500Z 2025-05-07T20:01:52.3319314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3322262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3323624Z ^ 2025-05-07T20:01:52.3324313Z 2025-05-07T20:01:52.3325928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3328870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3330181Z ^ 2025-05-07T20:01:52.3330474Z 2025-05-07T20:01:52.3330966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:52.3331725Z 2025-05-07T20:01:52.3333521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3336575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3337916Z ^ 2025-05-07T20:01:52.3338324Z 2025-05-07T20:01:52.3340228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3343101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3344350Z ^ 2025-05-07T20:01:52.3344642Z 2025-05-07T20:01:52.3345171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:52.3346007Z 2025-05-07T20:01:52.3347793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3350732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3352247Z ^ 2025-05-07T20:01:52.3352657Z 2025-05-07T20:01:52.3354415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3357325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3358652Z ^ 2025-05-07T20:01:52.3358951Z 2025-05-07T20:01:52.3359442Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:52.3360197Z 2025-05-07T20:01:52.3361884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3365068Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3366386Z ^ 2025-05-07T20:01:52.3366800Z 2025-05-07T20:01:52.3368602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3371453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3372780Z ^ 2025-05-07T20:01:52.3373234Z 2025-05-07T20:01:52.3373986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:52.3374723Z 2025-05-07T20:01:52.3376511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:52.3379451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:52.3380677Z ^ 2025-05-07T20:01:52.3380991Z 2025-05-07T20:02:00.9629999Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:00.9654441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9657073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9658277Z ^ 2025-05-07T20:02:00.9658547Z 2025-05-07T20:02:00.9659031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:00.9659742Z 2025-05-07T20:02:00.9661224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9664286Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9665725Z ^ 2025-05-07T20:02:00.9666120Z 2025-05-07T20:02:00.9667819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9670646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9672149Z ^ 2025-05-07T20:02:00.9672405Z 2025-05-07T20:02:00.9672856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:00.9673515Z 2025-05-07T20:02:00.9675327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9678019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9679240Z ^ 2025-05-07T20:02:00.9679615Z 2025-05-07T20:02:00.9681618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9684349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9685502Z ^ 2025-05-07T20:02:00.9685747Z 2025-05-07T20:02:00.9686166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:00.9686799Z 2025-05-07T20:02:00.9688326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9691014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9692179Z ^ 2025-05-07T20:02:00.9692560Z 2025-05-07T20:02:00.9694320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9697045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9698281Z ^ 2025-05-07T20:02:00.9698540Z 2025-05-07T20:02:00.9699017Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:00.9699701Z 2025-05-07T20:02:00.9701309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9703874Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9704979Z ^ 2025-05-07T20:02:00.9705299Z 2025-05-07T20:02:00.9706849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9709816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9711002Z ^ 2025-05-07T20:02:00.9711301Z 2025-05-07T20:02:00.9711898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:00.9712536Z 2025-05-07T20:02:00.9714166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:00.9716853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:00.9718243Z ^ 2025-05-07T20:02:00.9718606Z 2025-05-07T20:02:04.0830386Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:04.0857157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0860259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0861867Z ^ 2025-05-07T20:02:04.0862167Z 2025-05-07T20:02:04.0862696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.0863420Z 2025-05-07T20:02:04.0865576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0868540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0869856Z ^ 2025-05-07T20:02:04.0870292Z 2025-05-07T20:02:04.0872367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0875308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0876613Z ^ 2025-05-07T20:02:04.0877113Z 2025-05-07T20:02:04.0877606Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.0878338Z 2025-05-07T20:02:04.0880166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0883219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0884569Z ^ 2025-05-07T20:02:04.0884971Z 2025-05-07T20:02:04.0886745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0889546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0890867Z ^ 2025-05-07T20:02:04.0891149Z 2025-05-07T20:02:04.0891640Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.0892400Z 2025-05-07T20:02:04.0894166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0897125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0898449Z ^ 2025-05-07T20:02:04.0898882Z 2025-05-07T20:02:04.0900664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0903562Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0904860Z ^ 2025-05-07T20:02:04.0905166Z 2025-05-07T20:02:04.0905679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.0906436Z 2025-05-07T20:02:04.0908248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0911549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0912864Z ^ 2025-05-07T20:02:04.0913266Z 2025-05-07T20:02:04.0915069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0917950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0919280Z ^ 2025-05-07T20:02:04.0919567Z 2025-05-07T20:02:04.0920090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:04.0920939Z 2025-05-07T20:02:04.0922734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:04.0925775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:04.0927119Z ^ 2025-05-07T20:02:04.0927529Z 2025-05-07T20:02:06.3795049Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:06.3821135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3824165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3825474Z ^ 2025-05-07T20:02:06.3825767Z 2025-05-07T20:02:06.3826287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:06.3827015Z 2025-05-07T20:02:06.3828838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3832259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3833604Z ^ 2025-05-07T20:02:06.3834009Z 2025-05-07T20:02:06.3835944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3838860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3840185Z ^ 2025-05-07T20:02:06.3840476Z 2025-05-07T20:02:06.3840964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:06.3841816Z 2025-05-07T20:02:06.3843618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3846580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3847894Z ^ 2025-05-07T20:02:06.3848301Z 2025-05-07T20:02:06.3850102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3853009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3854359Z ^ 2025-05-07T20:02:06.3854651Z 2025-05-07T20:02:06.3855173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:06.3855916Z 2025-05-07T20:02:06.3857730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3860677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3862021Z ^ 2025-05-07T20:02:06.3862433Z 2025-05-07T20:02:06.3864226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3867425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3868731Z ^ 2025-05-07T20:02:06.3869048Z 2025-05-07T20:02:06.3869764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:06.3870486Z 2025-05-07T20:02:06.3872405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3875315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3876653Z ^ 2025-05-07T20:02:06.3877061Z 2025-05-07T20:02:06.3878874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3881914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3883255Z ^ 2025-05-07T20:02:06.3883543Z 2025-05-07T20:02:06.3884034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:06.3884792Z 2025-05-07T20:02:06.3886718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:06.3889676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:06.3890978Z ^ 2025-05-07T20:02:06.3891403Z 2025-05-07T20:02:09.9245455Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:09.9270850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9273867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9275185Z ^ 2025-05-07T20:02:09.9275517Z 2025-05-07T20:02:09.9276016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.9277057Z 2025-05-07T20:02:09.9278858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9281980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9283334Z ^ 2025-05-07T20:02:09.9283740Z 2025-05-07T20:02:09.9285636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9288655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9289994Z ^ 2025-05-07T20:02:09.9290287Z 2025-05-07T20:02:09.9290784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.9291552Z 2025-05-07T20:02:09.9293345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9296273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9297583Z ^ 2025-05-07T20:02:09.9298020Z 2025-05-07T20:02:09.9299812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9302740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9304053Z ^ 2025-05-07T20:02:09.9304370Z 2025-05-07T20:02:09.9304866Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.9305600Z 2025-05-07T20:02:09.9307469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9310419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9311943Z ^ 2025-05-07T20:02:09.9312349Z 2025-05-07T20:02:09.9314134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9317277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9318588Z ^ 2025-05-07T20:02:09.9318867Z 2025-05-07T20:02:09.9319368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.9320122Z 2025-05-07T20:02:09.9321936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9324887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9326196Z ^ 2025-05-07T20:02:09.9329209Z 2025-05-07T20:02:09.9330998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9334019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9335310Z ^ 2025-05-07T20:02:09.9335597Z 2025-05-07T20:02:09.9336105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:09.9336825Z 2025-05-07T20:02:09.9338570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.9341619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:09.9342965Z ^ 2025-05-07T20:02:09.9343379Z 2025-05-07T20:02:18.1753393Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:18.1775657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1778316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1779309Z ^ 2025-05-07T20:02:18.1779515Z 2025-05-07T20:02:18.1779931Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:18.1780719Z 2025-05-07T20:02:18.1782201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1784673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1785948Z ^ 2025-05-07T20:02:18.1786280Z 2025-05-07T20:02:18.1787690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1790185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1791273Z ^ 2025-05-07T20:02:18.1791667Z 2025-05-07T20:02:18.1792086Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:18.1792703Z 2025-05-07T20:02:18.1794166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1796547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1797691Z ^ 2025-05-07T20:02:18.1797993Z 2025-05-07T20:02:18.1799452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1801896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1803037Z ^ 2025-05-07T20:02:18.1803281Z 2025-05-07T20:02:18.1803729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:18.1804372Z 2025-05-07T20:02:18.1805899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1808417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1809774Z ^ 2025-05-07T20:02:18.1810157Z 2025-05-07T20:02:18.1811707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1814151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1815216Z ^ 2025-05-07T20:02:18.1815487Z 2025-05-07T20:02:18.1815894Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:18.1816466Z 2025-05-07T20:02:18.1817820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1820274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1821370Z ^ 2025-05-07T20:02:18.1821714Z 2025-05-07T20:02:18.1823295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1825797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1826948Z ^ 2025-05-07T20:02:18.1827177Z 2025-05-07T20:02:18.1827667Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:18.1828301Z 2025-05-07T20:02:18.1829821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.1832504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:18.1833613Z ^ 2025-05-07T20:02:18.1833971Z 2025-05-07T20:02:19.3361466Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:19.3385900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3388827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3389977Z ^ 2025-05-07T20:02:19.3390234Z 2025-05-07T20:02:19.3390697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.3391684Z 2025-05-07T20:02:19.3393459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3396123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3397367Z ^ 2025-05-07T20:02:19.3397777Z 2025-05-07T20:02:19.3399578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3402382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3403599Z ^ 2025-05-07T20:02:19.3403900Z 2025-05-07T20:02:19.3404364Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.3405068Z 2025-05-07T20:02:19.3406818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3409707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3411101Z ^ 2025-05-07T20:02:19.3411473Z 2025-05-07T20:02:19.3413148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3415876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3417086Z ^ 2025-05-07T20:02:19.3417345Z 2025-05-07T20:02:19.3417806Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.3418483Z 2025-05-07T20:02:19.3420317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3422910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3424101Z ^ 2025-05-07T20:02:19.3424482Z 2025-05-07T20:02:19.3426074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3428622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3429931Z ^ 2025-05-07T20:02:19.3430216Z 2025-05-07T20:02:19.3430681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.3431344Z 2025-05-07T20:02:19.3433297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3435923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3437260Z ^ 2025-05-07T20:02:19.3437639Z 2025-05-07T20:02:19.3439455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3442190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3443303Z ^ 2025-05-07T20:02:19.3443547Z 2025-05-07T20:02:19.3443968Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.3444630Z 2025-05-07T20:02:19.3446189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.3448892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.3450025Z ^ 2025-05-07T20:02:19.3450405Z 2025-05-07T20:02:19.5700283Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:19.5725315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5728254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5729533Z ^ 2025-05-07T20:02:19.5729810Z 2025-05-07T20:02:19.5730284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5731102Z 2025-05-07T20:02:19.5732772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5735616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5736838Z ^ 2025-05-07T20:02:19.5737207Z 2025-05-07T20:02:19.5738854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5741648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5742856Z ^ 2025-05-07T20:02:19.5743114Z 2025-05-07T20:02:19.5743561Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5744252Z 2025-05-07T20:02:19.5745988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5748763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5750009Z ^ 2025-05-07T20:02:19.5750397Z 2025-05-07T20:02:19.5752362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5755166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5756388Z ^ 2025-05-07T20:02:19.5756662Z 2025-05-07T20:02:19.5757160Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5757805Z 2025-05-07T20:02:19.5759349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5761945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5763139Z ^ 2025-05-07T20:02:19.5763627Z 2025-05-07T20:02:19.5765556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5768129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5769449Z ^ 2025-05-07T20:02:19.5769760Z 2025-05-07T20:02:19.5770216Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5770913Z 2025-05-07T20:02:19.5772713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5775751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5777023Z ^ 2025-05-07T20:02:19.5777416Z 2025-05-07T20:02:19.5779157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5781926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5783035Z ^ 2025-05-07T20:02:19.5783285Z 2025-05-07T20:02:19.5783718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5784385Z 2025-05-07T20:02:19.5786000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5788564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5789765Z ^ 2025-05-07T20:02:19.5790133Z 2025-05-07T20:02:22.6055305Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:22.6080323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6083182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6084329Z ^ 2025-05-07T20:02:22.6084607Z 2025-05-07T20:02:22.6085055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.6085675Z 2025-05-07T20:02:22.6087255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6089883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6091034Z ^ 2025-05-07T20:02:22.6091373Z 2025-05-07T20:02:22.6092912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6095602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6096784Z ^ 2025-05-07T20:02:22.6097073Z 2025-05-07T20:02:22.6097518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.6098180Z 2025-05-07T20:02:22.6099892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6102601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6104124Z ^ 2025-05-07T20:02:22.6104454Z 2025-05-07T20:02:22.6106033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6108715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6109961Z ^ 2025-05-07T20:02:22.6110227Z 2025-05-07T20:02:22.6110635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.6111313Z 2025-05-07T20:02:22.6113182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6116178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6117568Z ^ 2025-05-07T20:02:22.6117970Z 2025-05-07T20:02:22.6119730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6122616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6123881Z ^ 2025-05-07T20:02:22.6124180Z 2025-05-07T20:02:22.6124818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.6125474Z 2025-05-07T20:02:22.6127386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6130079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6131305Z ^ 2025-05-07T20:02:22.6131811Z 2025-05-07T20:02:22.6133397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6136211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6137457Z ^ 2025-05-07T20:02:22.6137728Z 2025-05-07T20:02:22.6138193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:22.6138918Z 2025-05-07T20:02:22.6140647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:22.6143458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:22.6144632Z ^ 2025-05-07T20:02:22.6145013Z 2025-05-07T20:02:27.8741211Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:27.8766411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8769144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8770246Z ^ 2025-05-07T20:02:27.8770531Z 2025-05-07T20:02:27.8771004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:27.8771659Z 2025-05-07T20:02:27.8773330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8776062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8777250Z ^ 2025-05-07T20:02:27.8777607Z 2025-05-07T20:02:27.8779195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8781836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8782989Z ^ 2025-05-07T20:02:27.8783242Z 2025-05-07T20:02:27.8783684Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:27.8784597Z 2025-05-07T20:02:27.8786312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8789036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8790249Z ^ 2025-05-07T20:02:27.8790633Z 2025-05-07T20:02:27.8792346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8795031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8796338Z ^ 2025-05-07T20:02:27.8796600Z 2025-05-07T20:02:27.8797088Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:27.8797732Z 2025-05-07T20:02:27.8799415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8801972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8803134Z ^ 2025-05-07T20:02:27.8803471Z 2025-05-07T20:02:27.8808421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8811252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8812387Z ^ 2025-05-07T20:02:27.8812619Z 2025-05-07T20:02:27.8813027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:27.8813661Z 2025-05-07T20:02:27.8815241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8817762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8818844Z ^ 2025-05-07T20:02:27.8819186Z 2025-05-07T20:02:27.8820764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8823228Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8824300Z ^ 2025-05-07T20:02:27.8824524Z 2025-05-07T20:02:27.8824958Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:27.8825520Z 2025-05-07T20:02:27.8827090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:27.8829585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:27.8830774Z ^ 2025-05-07T20:02:27.8831124Z 2025-05-07T20:02:45.2748414Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:45.2775900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2778604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2779977Z ^ 2025-05-07T20:02:45.2780262Z 2025-05-07T20:02:45.2780729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.2781416Z 2025-05-07T20:02:45.2783122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2785923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2787182Z ^ 2025-05-07T20:02:45.2787559Z 2025-05-07T20:02:45.2789372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2792770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2793986Z ^ 2025-05-07T20:02:45.2794253Z 2025-05-07T20:02:45.2794721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.2795400Z 2025-05-07T20:02:45.2796957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2799689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2800821Z ^ 2025-05-07T20:02:45.2801352Z 2025-05-07T20:02:45.2802867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2805664Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2806879Z ^ 2025-05-07T20:02:45.2807144Z 2025-05-07T20:02:45.2807631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.2808297Z 2025-05-07T20:02:45.2809972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2812867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2814129Z ^ 2025-05-07T20:02:45.2814522Z 2025-05-07T20:02:45.2816234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2818972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2820042Z ^ 2025-05-07T20:02:45.2820317Z 2025-05-07T20:02:45.2820887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.2821594Z 2025-05-07T20:02:45.2823497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2826037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2827223Z ^ 2025-05-07T20:02:45.2827594Z 2025-05-07T20:02:45.2829326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2832187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2833409Z ^ 2025-05-07T20:02:45.2833674Z 2025-05-07T20:02:45.2834160Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:45.2834849Z 2025-05-07T20:02:45.2836572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2839748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:45.2840976Z ^ 2025-05-07T20:02:45.2841366Z 2025-05-07T20:02:46.1391183Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:46.1411539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1414052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1415164Z ^ 2025-05-07T20:02:46.1415453Z 2025-05-07T20:02:46.1415902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:46.1416523Z 2025-05-07T20:02:46.1418049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1420466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1421917Z ^ 2025-05-07T20:02:46.1422244Z 2025-05-07T20:02:46.1423746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1426236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1427339Z ^ 2025-05-07T20:02:46.1427608Z 2025-05-07T20:02:46.1428042Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:46.1428671Z 2025-05-07T20:02:46.1430158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1432901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1434008Z ^ 2025-05-07T20:02:46.1434381Z 2025-05-07T20:02:46.1435968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1438382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1439474Z ^ 2025-05-07T20:02:46.1439778Z 2025-05-07T20:02:46.1440284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:46.1440941Z 2025-05-07T20:02:46.1442413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1444875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1445990Z ^ 2025-05-07T20:02:46.1446349Z 2025-05-07T20:02:46.1447805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1450398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1451556Z ^ 2025-05-07T20:02:46.1451781Z 2025-05-07T20:02:46.1452206Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:46.1452891Z 2025-05-07T20:02:46.1454362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1456802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1457920Z ^ 2025-05-07T20:02:46.1458297Z 2025-05-07T20:02:46.1459778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1462155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1463406Z ^ 2025-05-07T20:02:46.1463657Z 2025-05-07T20:02:46.1464108Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:46.1465020Z 2025-05-07T20:02:46.1466505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:46.1468924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:46.1470054Z ^ 2025-05-07T20:02:46.1470421Z 2025-05-07T20:02:48.2561936Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:48.2574820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2576255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2576932Z ^ 2025-05-07T20:02:48.2577088Z 2025-05-07T20:02:48.2577368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2577827Z 2025-05-07T20:02:48.2578710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2580165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2580820Z ^ 2025-05-07T20:02:48.2581056Z 2025-05-07T20:02:48.2581920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2583527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2584232Z ^ 2025-05-07T20:02:48.2584416Z 2025-05-07T20:02:48.2584668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2585034Z 2025-05-07T20:02:48.2586052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2587481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2588155Z ^ 2025-05-07T20:02:48.2588364Z 2025-05-07T20:02:48.2589319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2590731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2591520Z ^ 2025-05-07T20:02:48.2591681Z 2025-05-07T20:02:48.2591956Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2592320Z 2025-05-07T20:02:48.2593189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2594628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2595272Z ^ 2025-05-07T20:02:48.2595508Z 2025-05-07T20:02:48.2596374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2597803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2598433Z ^ 2025-05-07T20:02:48.2598607Z 2025-05-07T20:02:48.2598854Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2599219Z 2025-05-07T20:02:48.2600109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2601519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2602192Z ^ 2025-05-07T20:02:48.2602396Z 2025-05-07T20:02:48.2603343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2604742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2605394Z ^ 2025-05-07T20:02:48.2605538Z 2025-05-07T20:02:48.2605786Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:48.2606177Z 2025-05-07T20:02:48.2607050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:48.2608563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:48.2609216Z ^ 2025-05-07T20:02:48.2609454Z 2025-05-07T20:02:55.5936277Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:02:55.5953448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5955649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5956467Z ^ 2025-05-07T20:02:55.5956664Z 2025-05-07T20:02:55.5957023Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:55.5957491Z 2025-05-07T20:02:55.5958612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5960445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5961301Z ^ 2025-05-07T20:02:55.5961750Z 2025-05-07T20:02:55.5962872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5964953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5965966Z ^ 2025-05-07T20:02:55.5966161Z 2025-05-07T20:02:55.5966644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:55.5967129Z 2025-05-07T20:02:55.5968333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5970406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5971326Z ^ 2025-05-07T20:02:55.5971610Z 2025-05-07T20:02:55.5972785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5974723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5975616Z ^ 2025-05-07T20:02:55.5975819Z 2025-05-07T20:02:55.5976147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:55.5976679Z 2025-05-07T20:02:55.5977873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5979830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5980706Z ^ 2025-05-07T20:02:55.5981004Z 2025-05-07T20:02:55.5982158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5984049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5984886Z ^ 2025-05-07T20:02:55.5985080Z 2025-05-07T20:02:55.5985428Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:55.5985907Z 2025-05-07T20:02:55.5987078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5989168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5990058Z ^ 2025-05-07T20:02:55.5990446Z 2025-05-07T20:02:55.5991731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5993571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5994567Z ^ 2025-05-07T20:02:55.5994769Z 2025-05-07T20:02:55.5995080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:55.5995728Z 2025-05-07T20:02:55.5996841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:55.5998778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:55.5999624Z ^ 2025-05-07T20:02:55.5999877Z 2025-05-07T20:02:56.5531606Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:56.5557384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5560281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5561465Z ^ 2025-05-07T20:02:56.5561729Z 2025-05-07T20:02:56.5562186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:56.5562897Z 2025-05-07T20:02:56.5564633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5568082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5569269Z ^ 2025-05-07T20:02:56.5569679Z 2025-05-07T20:02:56.5571445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5574020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5575226Z ^ 2025-05-07T20:02:56.5575493Z 2025-05-07T20:02:56.5576105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:56.5576811Z 2025-05-07T20:02:56.5578746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5581646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5582954Z ^ 2025-05-07T20:02:56.5583339Z 2025-05-07T20:02:56.5585050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5587870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5589129Z ^ 2025-05-07T20:02:56.5589402Z 2025-05-07T20:02:56.5589877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:56.5590615Z 2025-05-07T20:02:56.5592569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5595479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5596635Z ^ 2025-05-07T20:02:56.5597037Z 2025-05-07T20:02:56.5598782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5601646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5602904Z ^ 2025-05-07T20:02:56.5603157Z 2025-05-07T20:02:56.5603600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:56.5604290Z 2025-05-07T20:02:56.5606016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5608821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5610116Z ^ 2025-05-07T20:02:56.5610507Z 2025-05-07T20:02:56.5612165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5615074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5616350Z ^ 2025-05-07T20:02:56.5616622Z 2025-05-07T20:02:56.5617226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:56.5617946Z 2025-05-07T20:02:56.5619763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:56.5622736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:56.5624066Z ^ 2025-05-07T20:02:56.5624459Z 2025-05-07T20:02:57.9274140Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:02:57.9296876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9299629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9301062Z ^ 2025-05-07T20:02:57.9301336Z 2025-05-07T20:02:57.9301895Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.9302583Z 2025-05-07T20:02:57.9304532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9307236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9308393Z ^ 2025-05-07T20:02:57.9308730Z 2025-05-07T20:02:57.9310423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9313220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9314376Z ^ 2025-05-07T20:02:57.9314619Z 2025-05-07T20:02:57.9315081Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.9315755Z 2025-05-07T20:02:57.9317446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9320146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9321318Z ^ 2025-05-07T20:02:57.9321616Z 2025-05-07T20:02:57.9323040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9325774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9326953Z ^ 2025-05-07T20:02:57.9327224Z 2025-05-07T20:02:57.9327636Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.9328235Z 2025-05-07T20:02:57.9329869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9332526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9333684Z ^ 2025-05-07T20:02:57.9334059Z 2025-05-07T20:02:57.9335887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9338395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9339597Z ^ 2025-05-07T20:02:57.9339838Z 2025-05-07T20:02:57.9340265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.9340875Z 2025-05-07T20:02:57.9342615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9345540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9346694Z ^ 2025-05-07T20:02:57.9347058Z 2025-05-07T20:02:57.9348795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9351653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9352761Z ^ 2025-05-07T20:02:57.9353022Z 2025-05-07T20:02:57.9353466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:57.9354135Z 2025-05-07T20:02:57.9355964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:57.9358659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:57.9359871Z ^ 2025-05-07T20:02:57.9360250Z 2025-05-07T20:02:58.3757801Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:58.3781900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3784953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3786120Z ^ 2025-05-07T20:02:58.3786385Z 2025-05-07T20:02:58.3786811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.3787465Z 2025-05-07T20:02:58.3789318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3792389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3793653Z ^ 2025-05-07T20:02:58.3794038Z 2025-05-07T20:02:58.3795850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3798596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3799826Z ^ 2025-05-07T20:02:58.3800104Z 2025-05-07T20:02:58.3800586Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.3801252Z 2025-05-07T20:02:58.3802969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3805780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3806994Z ^ 2025-05-07T20:02:58.3807344Z 2025-05-07T20:02:58.3809041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3811857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3812951Z ^ 2025-05-07T20:02:58.3813184Z 2025-05-07T20:02:58.3813610Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.3814198Z 2025-05-07T20:02:58.3815694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3818402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3819536Z ^ 2025-05-07T20:02:58.3819887Z 2025-05-07T20:02:58.3821523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3824203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3825587Z ^ 2025-05-07T20:02:58.3826038Z 2025-05-07T20:02:58.3826501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.3827190Z 2025-05-07T20:02:58.3828889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3832030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3833201Z ^ 2025-05-07T20:02:58.3833579Z 2025-05-07T20:02:58.3835158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3837901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3839062Z ^ 2025-05-07T20:02:58.3839336Z 2025-05-07T20:02:58.3839776Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:58.3840407Z 2025-05-07T20:02:58.3842025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:58.3844885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:58.3846119Z ^ 2025-05-07T20:02:58.3846499Z 2025-05-07T20:03:00.5307618Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:00.5332835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5335396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5336502Z ^ 2025-05-07T20:03:00.5337039Z 2025-05-07T20:03:00.5337464Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.5338081Z 2025-05-07T20:03:00.5339520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5341964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5343253Z ^ 2025-05-07T20:03:00.5343624Z 2025-05-07T20:03:00.5345070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5347560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5348640Z ^ 2025-05-07T20:03:00.5348887Z 2025-05-07T20:03:00.5349314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.5349878Z 2025-05-07T20:03:00.5351719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5354322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5355420Z ^ 2025-05-07T20:03:00.5355736Z 2025-05-07T20:03:00.5357002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5358925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5360031Z ^ 2025-05-07T20:03:00.5360268Z 2025-05-07T20:03:00.5360679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.5361258Z 2025-05-07T20:03:00.5362837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5365766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5366886Z ^ 2025-05-07T20:03:00.5367228Z 2025-05-07T20:03:00.5368756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5371373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5372452Z ^ 2025-05-07T20:03:00.5372690Z 2025-05-07T20:03:00.5373303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.5373907Z 2025-05-07T20:03:00.5375369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5377961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5379089Z ^ 2025-05-07T20:03:00.5379428Z 2025-05-07T20:03:00.5380936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5383417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5384448Z ^ 2025-05-07T20:03:00.5384707Z 2025-05-07T20:03:00.5385105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:00.5385715Z 2025-05-07T20:03:00.5387220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:00.5389699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:00.5390804Z ^ 2025-05-07T20:03:00.5391152Z 2025-05-07T20:03:01.2070069Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:03:01.2088695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2091494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2092762Z ^ 2025-05-07T20:03:01.2092992Z 2025-05-07T20:03:01.2093458Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:01.2094178Z 2025-05-07T20:03:01.2095868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2098019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2098929Z ^ 2025-05-07T20:03:01.2099248Z 2025-05-07T20:03:01.2100425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2102541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2103477Z ^ 2025-05-07T20:03:01.2103718Z 2025-05-07T20:03:01.2104075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:01.2104741Z 2025-05-07T20:03:01.2106119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2108149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2109093Z ^ 2025-05-07T20:03:01.2109395Z 2025-05-07T20:03:01.2110664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2113099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2114005Z ^ 2025-05-07T20:03:01.2114204Z 2025-05-07T20:03:01.2114550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:01.2115059Z 2025-05-07T20:03:01.2116302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2118560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2119553Z ^ 2025-05-07T20:03:01.2119894Z 2025-05-07T20:03:01.2121170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2123221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2124104Z ^ 2025-05-07T20:03:01.2124351Z 2025-05-07T20:03:01.2124712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:01.2125201Z 2025-05-07T20:03:01.2126495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2128445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2129373Z ^ 2025-05-07T20:03:01.2129687Z 2025-05-07T20:03:01.2130861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2132940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2134127Z ^ 2025-05-07T20:03:01.2134323Z 2025-05-07T20:03:01.2134663Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:01.2135261Z 2025-05-07T20:03:01.2136609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:01.2138748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:01.2139796Z ^ 2025-05-07T20:03:01.2140158Z 2025-05-07T20:03:02.4666881Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:02.4691110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4693733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4694928Z ^ 2025-05-07T20:03:02.4695154Z 2025-05-07T20:03:02.4695545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4696134Z 2025-05-07T20:03:02.4697526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4700168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4701372Z ^ 2025-05-07T20:03:02.4701738Z 2025-05-07T20:03:02.4703415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4705933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4707546Z ^ 2025-05-07T20:03:02.4707890Z 2025-05-07T20:03:02.4708537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4709418Z 2025-05-07T20:03:02.4711813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4715430Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4716838Z ^ 2025-05-07T20:03:02.4717265Z 2025-05-07T20:03:02.4719421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4723039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4724557Z ^ 2025-05-07T20:03:02.4724925Z 2025-05-07T20:03:02.4725516Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4726562Z 2025-05-07T20:03:02.4728723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4732197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4733704Z ^ 2025-05-07T20:03:02.4734142Z 2025-05-07T20:03:02.4736226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4739536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4741056Z ^ 2025-05-07T20:03:02.4741374Z 2025-05-07T20:03:02.4741932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4742779Z 2025-05-07T20:03:02.4744801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4748176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4749671Z ^ 2025-05-07T20:03:02.4750159Z 2025-05-07T20:03:02.4752304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4755649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4757089Z ^ 2025-05-07T20:03:02.4757441Z 2025-05-07T20:03:02.4757980Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:02.4758806Z 2025-05-07T20:03:02.4760930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:02.4764520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:02.4766382Z ^ 2025-05-07T20:03:02.4766836Z 2025-05-07T20:03:07.4576830Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:07.4598475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4601050Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4602213Z ^ 2025-05-07T20:03:07.4602564Z 2025-05-07T20:03:07.4603005Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.4603607Z 2025-05-07T20:03:07.4605208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4607813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4608930Z ^ 2025-05-07T20:03:07.4609322Z 2025-05-07T20:03:07.4610903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4626416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4627474Z ^ 2025-05-07T20:03:07.4627942Z 2025-05-07T20:03:07.4628335Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.4628919Z 2025-05-07T20:03:07.4630308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4632908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4633948Z ^ 2025-05-07T20:03:07.4634292Z 2025-05-07T20:03:07.4635863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4638409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4639483Z ^ 2025-05-07T20:03:07.4639709Z 2025-05-07T20:03:07.4640127Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.4640715Z 2025-05-07T20:03:07.4642659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4645221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4646335Z ^ 2025-05-07T20:03:07.4646836Z 2025-05-07T20:03:07.4648284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4650668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4651777Z ^ 2025-05-07T20:03:07.4652039Z 2025-05-07T20:03:07.4652457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.4653093Z 2025-05-07T20:03:07.4654754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4657244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4658365Z ^ 2025-05-07T20:03:07.4658714Z 2025-05-07T20:03:07.4660273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4662743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4663907Z ^ 2025-05-07T20:03:07.4664159Z 2025-05-07T20:03:07.4664632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:07.4665504Z 2025-05-07T20:03:07.4667104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:07.4669492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:07.4670819Z ^ 2025-05-07T20:03:07.4671197Z 2025-05-07T20:03:07.4781099Z [385/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:07.4799641Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:09.9221897Z [386/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:09.9244204Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:10.2731222Z [387/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:10.2753023Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:11.2380645Z [388/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:11.2393435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2394964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2395638Z ^ 2025-05-07T20:03:11.2395803Z 2025-05-07T20:03:11.2396084Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:11.2396453Z 2025-05-07T20:03:11.2397334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2398777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2399435Z ^ 2025-05-07T20:03:11.2399671Z 2025-05-07T20:03:11.2400543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2401971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2402610Z ^ 2025-05-07T20:03:11.2402786Z 2025-05-07T20:03:11.2403039Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:11.2403400Z 2025-05-07T20:03:11.2404293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2405718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2406468Z ^ 2025-05-07T20:03:11.2406675Z 2025-05-07T20:03:11.2407572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2408976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2409649Z ^ 2025-05-07T20:03:11.2409802Z 2025-05-07T20:03:11.2410055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:11.2410438Z 2025-05-07T20:03:11.2411316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2412800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2413447Z ^ 2025-05-07T20:03:11.2413674Z 2025-05-07T20:03:11.2414586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2416013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2416652Z ^ 2025-05-07T20:03:11.2416821Z 2025-05-07T20:03:11.2417110Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:11.2417482Z 2025-05-07T20:03:11.2418351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2419799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2420475Z ^ 2025-05-07T20:03:11.2420685Z 2025-05-07T20:03:11.2421551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2422973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2423635Z ^ 2025-05-07T20:03:11.2423788Z 2025-05-07T20:03:11.2424040Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:11.2424436Z 2025-05-07T20:03:11.2425313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:11.2426756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:11.2427405Z ^ 2025-05-07T20:03:11.2427612Z 2025-05-07T20:03:18.0054425Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:18.0078279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0080944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0082061Z ^ 2025-05-07T20:03:18.0082367Z 2025-05-07T20:03:18.0082835Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.0083519Z 2025-05-07T20:03:18.0085221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0087628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0088934Z ^ 2025-05-07T20:03:18.0089344Z 2025-05-07T20:03:18.0091015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0093633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0094784Z ^ 2025-05-07T20:03:18.0095038Z 2025-05-07T20:03:18.0095448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.0096041Z 2025-05-07T20:03:18.0098148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0100735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0101941Z ^ 2025-05-07T20:03:18.0102362Z 2025-05-07T20:03:18.0103929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0106468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0107869Z ^ 2025-05-07T20:03:18.0108173Z 2025-05-07T20:03:18.0108598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.0109175Z 2025-05-07T20:03:18.0111036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0113661Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0114824Z ^ 2025-05-07T20:03:18.0115151Z 2025-05-07T20:03:18.0116896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0119619Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0120784Z ^ 2025-05-07T20:03:18.0121042Z 2025-05-07T20:03:18.0121475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.0122108Z 2025-05-07T20:03:18.0123878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0126491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0127861Z ^ 2025-05-07T20:03:18.0128269Z 2025-05-07T20:03:18.0129851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0132385Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0133517Z ^ 2025-05-07T20:03:18.0133769Z 2025-05-07T20:03:18.0134289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.0134901Z 2025-05-07T20:03:18.0136556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.0138949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.0140148Z ^ 2025-05-07T20:03:18.0140544Z 2025-05-07T20:03:23.1501912Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:23.1527153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1529792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1530983Z ^ 2025-05-07T20:03:23.1531240Z 2025-05-07T20:03:23.1531682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.1532328Z 2025-05-07T20:03:23.1534017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1536660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1537818Z ^ 2025-05-07T20:03:23.1538121Z 2025-05-07T20:03:23.1539496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1542060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1543394Z ^ 2025-05-07T20:03:23.1543621Z 2025-05-07T20:03:23.1544028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.1544607Z 2025-05-07T20:03:23.1546235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1548640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1549728Z ^ 2025-05-07T20:03:23.1550082Z 2025-05-07T20:03:23.1551751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1554123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1555224Z ^ 2025-05-07T20:03:23.1555487Z 2025-05-07T20:03:23.1555983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.1556633Z 2025-05-07T20:03:23.1558229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1560711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1561814Z ^ 2025-05-07T20:03:23.1562173Z 2025-05-07T20:03:23.1563802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1566611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1567790Z ^ 2025-05-07T20:03:23.1568054Z 2025-05-07T20:03:23.1568511Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.1569211Z 2025-05-07T20:03:23.1570714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1573132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1574096Z ^ 2025-05-07T20:03:23.1574410Z 2025-05-07T20:03:23.1575889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1578244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1579401Z ^ 2025-05-07T20:03:23.1579672Z 2025-05-07T20:03:23.1580053Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.1580645Z 2025-05-07T20:03:23.1582285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.1584789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.1585901Z ^ 2025-05-07T20:03:23.1586227Z 2025-05-07T20:03:24.1366142Z [391/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:24.1393083Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:26.2440388Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:26.2458718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2460713Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2461723Z ^ 2025-05-07T20:03:26.2461933Z 2025-05-07T20:03:26.2462321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.2463090Z 2025-05-07T20:03:26.2464407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2467149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2468120Z ^ 2025-05-07T20:03:26.2468447Z 2025-05-07T20:03:26.2469670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2471891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2472794Z ^ 2025-05-07T20:03:26.2473071Z 2025-05-07T20:03:26.2473423Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.2473969Z 2025-05-07T20:03:26.2475306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2477399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2478353Z ^ 2025-05-07T20:03:26.2478649Z 2025-05-07T20:03:26.2479903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2481950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2483033Z ^ 2025-05-07T20:03:26.2483279Z 2025-05-07T20:03:26.2483809Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.2484299Z 2025-05-07T20:03:26.2485585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2487571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2488526Z ^ 2025-05-07T20:03:26.2488816Z 2025-05-07T20:03:26.2490076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2492166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2493072Z ^ 2025-05-07T20:03:26.2493271Z 2025-05-07T20:03:26.2493611Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.2494147Z 2025-05-07T20:03:26.2495451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2497432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2498321Z ^ 2025-05-07T20:03:26.2498636Z 2025-05-07T20:03:26.2499926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2501867Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2502728Z ^ 2025-05-07T20:03:26.2502956Z 2025-05-07T20:03:26.2503303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:26.2503793Z 2025-05-07T20:03:26.2505025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:26.2507130Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:26.2508264Z ^ 2025-05-07T20:03:26.2508554Z 2025-05-07T20:03:27.3705281Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:27.3729306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3731983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3733207Z ^ 2025-05-07T20:03:27.3733447Z 2025-05-07T20:03:27.3733892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.3734594Z 2025-05-07T20:03:27.3736201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3738761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3739884Z ^ 2025-05-07T20:03:27.3740248Z 2025-05-07T20:03:27.3741852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3744466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3745663Z ^ 2025-05-07T20:03:27.3745897Z 2025-05-07T20:03:27.3746385Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.3747067Z 2025-05-07T20:03:27.3748715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3751428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3753009Z ^ 2025-05-07T20:03:27.3753354Z 2025-05-07T20:03:27.3754984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3757798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3758963Z ^ 2025-05-07T20:03:27.3759260Z 2025-05-07T20:03:27.3759730Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.3760378Z 2025-05-07T20:03:27.3762016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3764993Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3766530Z ^ 2025-05-07T20:03:27.3766910Z 2025-05-07T20:03:27.3768587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3771406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3772602Z ^ 2025-05-07T20:03:27.3772848Z 2025-05-07T20:03:27.3773352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.3774062Z 2025-05-07T20:03:27.3775868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3778760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3779917Z ^ 2025-05-07T20:03:27.3780283Z 2025-05-07T20:03:27.3781843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3784447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3785645Z ^ 2025-05-07T20:03:27.3785898Z 2025-05-07T20:03:27.3786347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:27.3786984Z 2025-05-07T20:03:27.3788693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:27.3791270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:27.3792578Z ^ 2025-05-07T20:03:27.3792922Z 2025-05-07T20:03:28.2211353Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:28.2234173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2236845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2238116Z ^ 2025-05-07T20:03:28.2238402Z 2025-05-07T20:03:28.2238845Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.2239505Z 2025-05-07T20:03:28.2241051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2243763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2244999Z ^ 2025-05-07T20:03:28.2245349Z 2025-05-07T20:03:28.2246957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2249454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2250575Z ^ 2025-05-07T20:03:28.2250836Z 2025-05-07T20:03:28.2251270Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.2251936Z 2025-05-07T20:03:28.2253523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2256417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2257705Z ^ 2025-05-07T20:03:28.2258095Z 2025-05-07T20:03:28.2259616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2262233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2263307Z ^ 2025-05-07T20:03:28.2263534Z 2025-05-07T20:03:28.2263986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.2265072Z 2025-05-07T20:03:28.2266614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2269183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2270547Z ^ 2025-05-07T20:03:28.2270892Z 2025-05-07T20:03:28.2272514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2275317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2276322Z ^ 2025-05-07T20:03:28.2276559Z 2025-05-07T20:03:28.2276964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.2277581Z 2025-05-07T20:03:28.2279053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2281569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2282656Z ^ 2025-05-07T20:03:28.2282999Z 2025-05-07T20:03:28.2284527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2286988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2288163Z ^ 2025-05-07T20:03:28.2288421Z 2025-05-07T20:03:28.2288871Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:28.2289448Z 2025-05-07T20:03:28.2291125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:28.2293634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:28.2294791Z ^ 2025-05-07T20:03:28.2295164Z 2025-05-07T20:03:29.2907399Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:29.2931947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2934786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2936011Z ^ 2025-05-07T20:03:29.2936315Z 2025-05-07T20:03:29.2936816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.2937491Z 2025-05-07T20:03:29.2939173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2942002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2943218Z ^ 2025-05-07T20:03:29.2943587Z 2025-05-07T20:03:29.2945271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2948054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2949274Z ^ 2025-05-07T20:03:29.2949543Z 2025-05-07T20:03:29.2950257Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.2950914Z 2025-05-07T20:03:29.2952744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2955552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2956795Z ^ 2025-05-07T20:03:29.2957358Z 2025-05-07T20:03:29.2958960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2961762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2962970Z ^ 2025-05-07T20:03:29.2963232Z 2025-05-07T20:03:29.2963681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.2964368Z 2025-05-07T20:03:29.2966581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2969289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2970501Z ^ 2025-05-07T20:03:29.2970877Z 2025-05-07T20:03:29.2972705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2975357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2976544Z ^ 2025-05-07T20:03:29.2976804Z 2025-05-07T20:03:29.2977291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.2977938Z 2025-05-07T20:03:29.2979585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2982306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2983511Z ^ 2025-05-07T20:03:29.2983900Z 2025-05-07T20:03:29.2985844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2988628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2989842Z ^ 2025-05-07T20:03:29.2990147Z 2025-05-07T20:03:29.2990589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:29.2991233Z 2025-05-07T20:03:29.2993093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:29.2995837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:29.2997424Z ^ 2025-05-07T20:03:29.2997783Z 2025-05-07T20:03:33.1357017Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:33.1377896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1380578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1381761Z ^ 2025-05-07T20:03:33.1382033Z 2025-05-07T20:03:33.1382486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.1383173Z 2025-05-07T20:03:33.1384855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1387358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1388457Z ^ 2025-05-07T20:03:33.1388880Z 2025-05-07T20:03:33.1390318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1393187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1394270Z ^ 2025-05-07T20:03:33.1394531Z 2025-05-07T20:03:33.1394979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.1395584Z 2025-05-07T20:03:33.1397472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1399912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1401293Z ^ 2025-05-07T20:03:33.1401684Z 2025-05-07T20:03:33.1403162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1405770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1406908Z ^ 2025-05-07T20:03:33.1407194Z 2025-05-07T20:03:33.1407644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.1408302Z 2025-05-07T20:03:33.1409912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1412495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1413588Z ^ 2025-05-07T20:03:33.1413953Z 2025-05-07T20:03:33.1415385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1417682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1418753Z ^ 2025-05-07T20:03:33.1419035Z 2025-05-07T20:03:33.1419487Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.1420070Z 2025-05-07T20:03:33.1421524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1423894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1424979Z ^ 2025-05-07T20:03:33.1425329Z 2025-05-07T20:03:33.1426773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1429123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1430225Z ^ 2025-05-07T20:03:33.1430500Z 2025-05-07T20:03:33.1430929Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:33.1431712Z 2025-05-07T20:03:33.1433204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:33.1435949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:33.1437079Z ^ 2025-05-07T20:03:33.1437432Z 2025-05-07T20:03:34.5562869Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:34.5588538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5591777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5593004Z ^ 2025-05-07T20:03:34.5593271Z 2025-05-07T20:03:34.5593723Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.5594425Z 2025-05-07T20:03:34.5596124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5599297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5600560Z ^ 2025-05-07T20:03:34.5600971Z 2025-05-07T20:03:34.5602659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5605445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5606628Z ^ 2025-05-07T20:03:34.5606929Z 2025-05-07T20:03:34.5607379Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.5608354Z 2025-05-07T20:03:34.5610183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5613144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5614433Z ^ 2025-05-07T20:03:34.5614826Z 2025-05-07T20:03:34.5616503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5619072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5620296Z ^ 2025-05-07T20:03:34.5620565Z 2025-05-07T20:03:34.5620981Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.5621698Z 2025-05-07T20:03:34.5623432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5626217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5627444Z ^ 2025-05-07T20:03:34.5627850Z 2025-05-07T20:03:34.5629560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5632502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5633732Z ^ 2025-05-07T20:03:34.5634008Z 2025-05-07T20:03:34.5634516Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.5635179Z 2025-05-07T20:03:34.5636822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5639622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5640879Z ^ 2025-05-07T20:03:34.5641261Z 2025-05-07T20:03:34.5642952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5645654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5647029Z ^ 2025-05-07T20:03:34.5647296Z 2025-05-07T20:03:34.5647748Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.5648466Z 2025-05-07T20:03:34.5650141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.5652895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.5654096Z ^ 2025-05-07T20:03:34.5654479Z 2025-05-07T20:03:34.7708377Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:34.7733893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7736751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7738142Z ^ 2025-05-07T20:03:34.7738410Z 2025-05-07T20:03:34.7739167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.7739856Z 2025-05-07T20:03:34.7741599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7744415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7745658Z ^ 2025-05-07T20:03:34.7746021Z 2025-05-07T20:03:34.7747682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7750455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7751797Z ^ 2025-05-07T20:03:34.7752086Z 2025-05-07T20:03:34.7752551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.7753247Z 2025-05-07T20:03:34.7755126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7757903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7759080Z ^ 2025-05-07T20:03:34.7759459Z 2025-05-07T20:03:34.7761239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7764037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7765517Z ^ 2025-05-07T20:03:34.7765781Z 2025-05-07T20:03:34.7766243Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.7766946Z 2025-05-07T20:03:34.7768703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7771588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7772855Z ^ 2025-05-07T20:03:34.7773260Z 2025-05-07T20:03:34.7775022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7777955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7779182Z ^ 2025-05-07T20:03:34.7779497Z 2025-05-07T20:03:34.7779978Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.7780701Z 2025-05-07T20:03:34.7782546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7785293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7786616Z ^ 2025-05-07T20:03:34.7786936Z 2025-05-07T20:03:34.7788607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7791367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7792703Z ^ 2025-05-07T20:03:34.7792964Z 2025-05-07T20:03:34.7793409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:34.7794120Z 2025-05-07T20:03:34.7795835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:34.7798777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:34.7799990Z ^ 2025-05-07T20:03:34.7800391Z 2025-05-07T20:03:39.7701556Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:39.7725960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7729061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7730248Z ^ 2025-05-07T20:03:39.7730513Z 2025-05-07T20:03:39.7730988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.7731794Z 2025-05-07T20:03:39.7733556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7736365Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7737821Z ^ 2025-05-07T20:03:39.7738214Z 2025-05-07T20:03:39.7739835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7742732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7743967Z ^ 2025-05-07T20:03:39.7744265Z 2025-05-07T20:03:39.7744731Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.7745424Z 2025-05-07T20:03:39.7747199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7750027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7751310Z ^ 2025-05-07T20:03:39.7751902Z 2025-05-07T20:03:39.7753606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7756133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7757336Z ^ 2025-05-07T20:03:39.7757603Z 2025-05-07T20:03:39.7758035Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.7758729Z 2025-05-07T20:03:39.7760396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7763135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7764333Z ^ 2025-05-07T20:03:39.7765002Z 2025-05-07T20:03:39.7766704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7769393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7770552Z ^ 2025-05-07T20:03:39.7770824Z 2025-05-07T20:03:39.7771288Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.7771983Z 2025-05-07T20:03:39.7773717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7776710Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7777878Z ^ 2025-05-07T20:03:39.7778243Z 2025-05-07T20:03:39.7779890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7782580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7783777Z ^ 2025-05-07T20:03:39.7784204Z 2025-05-07T20:03:39.7784656Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:39.7785355Z 2025-05-07T20:03:39.7787016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:39.7791291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:39.7792665Z ^ 2025-05-07T20:03:39.7793050Z 2025-05-07T20:03:47.8442219Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:47.8466610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8469412Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8470638Z ^ 2025-05-07T20:03:47.8470896Z 2025-05-07T20:03:47.8471351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.8472100Z 2025-05-07T20:03:47.8473841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8476925Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8478138Z ^ 2025-05-07T20:03:47.8478500Z 2025-05-07T20:03:47.8480328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8483043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8484238Z ^ 2025-05-07T20:03:47.8484494Z 2025-05-07T20:03:47.8485073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.8485740Z 2025-05-07T20:03:47.8487476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8490082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8491290Z ^ 2025-05-07T20:03:47.8491654Z 2025-05-07T20:03:47.8493171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8495827Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8496991Z ^ 2025-05-07T20:03:47.8497230Z 2025-05-07T20:03:47.8497682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.8498374Z 2025-05-07T20:03:47.8500118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8502618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8503629Z ^ 2025-05-07T20:03:47.8503926Z 2025-05-07T20:03:47.8505298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8507486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8508747Z ^ 2025-05-07T20:03:47.8508959Z 2025-05-07T20:03:47.8509369Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.8509931Z 2025-05-07T20:03:47.8511765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8514382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8515566Z ^ 2025-05-07T20:03:47.8515915Z 2025-05-07T20:03:47.8517427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8519909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8521085Z ^ 2025-05-07T20:03:47.8521323Z 2025-05-07T20:03:47.8521789Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.8522373Z 2025-05-07T20:03:47.8523830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.8526292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.8527581Z ^ 2025-05-07T20:03:47.8527935Z 2025-05-07T20:03:48.7158913Z [401/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:03:48.7177800Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:49.1637807Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:49.1661884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1664602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1665984Z ^ 2025-05-07T20:03:49.1666237Z 2025-05-07T20:03:49.1666736Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.1667343Z 2025-05-07T20:03:49.1669057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1672057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1673319Z ^ 2025-05-07T20:03:49.1673714Z 2025-05-07T20:03:49.1675661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1678720Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1679912Z ^ 2025-05-07T20:03:49.1680175Z 2025-05-07T20:03:49.1680631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.1681341Z 2025-05-07T20:03:49.1683042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1685782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1687215Z ^ 2025-05-07T20:03:49.1687622Z 2025-05-07T20:03:49.1689445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1692252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1693463Z ^ 2025-05-07T20:03:49.1693715Z 2025-05-07T20:03:49.1694184Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.1694825Z 2025-05-07T20:03:49.1696670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1699512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1700771Z ^ 2025-05-07T20:03:49.1701152Z 2025-05-07T20:03:49.1702871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1705573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1706829Z ^ 2025-05-07T20:03:49.1707099Z 2025-05-07T20:03:49.1707603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.1708289Z 2025-05-07T20:03:49.1709986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1712981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1714239Z ^ 2025-05-07T20:03:49.1714625Z 2025-05-07T20:03:49.1716351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1719173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1720399Z ^ 2025-05-07T20:03:49.1720699Z 2025-05-07T20:03:49.1721166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.1721868Z 2025-05-07T20:03:49.1723802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.1726474Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.1727691Z ^ 2025-05-07T20:03:49.1728046Z 2025-05-07T20:03:49.6173493Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:49.6198367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6201250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6202568Z ^ 2025-05-07T20:03:49.6202857Z 2025-05-07T20:03:49.6203347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.6204080Z 2025-05-07T20:03:49.6205767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6208967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6210298Z ^ 2025-05-07T20:03:49.6210731Z 2025-05-07T20:03:49.6212548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6215150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6216381Z ^ 2025-05-07T20:03:49.6216663Z 2025-05-07T20:03:49.6217167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.6218003Z 2025-05-07T20:03:49.6219735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6222586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6223922Z ^ 2025-05-07T20:03:49.6224301Z 2025-05-07T20:03:49.6226000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6228836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6230042Z ^ 2025-05-07T20:03:49.6230304Z 2025-05-07T20:03:49.6230752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.6231428Z 2025-05-07T20:03:49.6233309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6236069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6237240Z ^ 2025-05-07T20:03:49.6237558Z 2025-05-07T20:03:49.6239252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6241978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6243182Z ^ 2025-05-07T20:03:49.6243456Z 2025-05-07T20:03:49.6243945Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.6244630Z 2025-05-07T20:03:49.6246349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6249137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6250347Z ^ 2025-05-07T20:03:49.6250755Z 2025-05-07T20:03:49.6252369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6255171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6256471Z ^ 2025-05-07T20:03:49.6256753Z 2025-05-07T20:03:49.6257210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:49.6257889Z 2025-05-07T20:03:49.6259581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:49.6262237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:49.6263459Z ^ 2025-05-07T20:03:49.6263831Z 2025-05-07T20:03:51.4906213Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:51.4927943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4930617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4931816Z ^ 2025-05-07T20:03:51.4932074Z 2025-05-07T20:03:51.4932513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.4933556Z 2025-05-07T20:03:51.4935163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4937709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4938752Z ^ 2025-05-07T20:03:51.4939106Z 2025-05-07T20:03:51.4940721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4943185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4944592Z ^ 2025-05-07T20:03:51.4944854Z 2025-05-07T20:03:51.4945301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.4945871Z 2025-05-07T20:03:51.4947485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4950012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4951109Z ^ 2025-05-07T20:03:51.4951435Z 2025-05-07T20:03:51.4953305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4955853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4957040Z ^ 2025-05-07T20:03:51.4957309Z 2025-05-07T20:03:51.4957734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.4958391Z 2025-05-07T20:03:51.4960048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4962542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4963644Z ^ 2025-05-07T20:03:51.4963993Z 2025-05-07T20:03:51.4966010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4968435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4969522Z ^ 2025-05-07T20:03:51.4969777Z 2025-05-07T20:03:51.4970183Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.4970837Z 2025-05-07T20:03:51.4972371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4974876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4975933Z ^ 2025-05-07T20:03:51.4976594Z 2025-05-07T20:03:51.4978163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4980637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4981719Z ^ 2025-05-07T20:03:51.4982003Z 2025-05-07T20:03:51.4982407Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:51.4982984Z 2025-05-07T20:03:51.4984572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:51.4987379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:51.4988530Z ^ 2025-05-07T20:03:51.4988890Z 2025-05-07T20:03:52.3185028Z [405/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:03:52.3205108Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:54.2624159Z [406/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:54.2648092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2650725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2651887Z ^ 2025-05-07T20:03:54.2652151Z 2025-05-07T20:03:54.2652565Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:54.2653231Z 2025-05-07T20:03:54.2654975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2657651Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2658819Z ^ 2025-05-07T20:03:54.2659218Z 2025-05-07T20:03:54.2660820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2663438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2664591Z ^ 2025-05-07T20:03:54.2665140Z 2025-05-07T20:03:54.2665613Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:54.2666244Z 2025-05-07T20:03:54.2667853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2670921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2672375Z ^ 2025-05-07T20:03:54.2672741Z 2025-05-07T20:03:54.2674231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2676858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2677998Z ^ 2025-05-07T20:03:54.2678249Z 2025-05-07T20:03:54.2678849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:54.2679481Z 2025-05-07T20:03:54.2680927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2685838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2687078Z ^ 2025-05-07T20:03:54.2687432Z 2025-05-07T20:03:54.2688991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2691701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2692861Z ^ 2025-05-07T20:03:54.2693116Z 2025-05-07T20:03:54.2693603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:54.2694220Z 2025-05-07T20:03:54.2695817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2698416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2699885Z ^ 2025-05-07T20:03:54.2700265Z 2025-05-07T20:03:54.2701871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2704561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2705774Z ^ 2025-05-07T20:03:54.2706047Z 2025-05-07T20:03:54.2706491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:54.2707165Z 2025-05-07T20:03:54.2708850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:54.2711683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:54.2712860Z ^ 2025-05-07T20:03:54.2713216Z 2025-05-07T20:03:55.6428933Z [407/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:03:55.6450281Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:56.3675617Z [408/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:03:56.3693720Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:56.8079617Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:56.8103299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8105782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8106965Z ^ 2025-05-07T20:03:56.8107229Z 2025-05-07T20:03:56.8107665Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.8108271Z 2025-05-07T20:03:56.8109859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8112902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8114722Z ^ 2025-05-07T20:03:56.8115121Z 2025-05-07T20:03:56.8116892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8119723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8121047Z ^ 2025-05-07T20:03:56.8121310Z 2025-05-07T20:03:56.8121744Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.8122419Z 2025-05-07T20:03:56.8123932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8126769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8127954Z ^ 2025-05-07T20:03:56.8128331Z 2025-05-07T20:03:56.8130087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8132747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8133932Z ^ 2025-05-07T20:03:56.8134177Z 2025-05-07T20:03:56.8134708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.8135339Z 2025-05-07T20:03:56.8136857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8139494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8140734Z ^ 2025-05-07T20:03:56.8141090Z 2025-05-07T20:03:56.8142829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8145636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8146851Z ^ 2025-05-07T20:03:56.8147121Z 2025-05-07T20:03:56.8147546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.8148230Z 2025-05-07T20:03:56.8149896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8152668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8153899Z ^ 2025-05-07T20:03:56.8154272Z 2025-05-07T20:03:56.8156016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8158831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8160255Z ^ 2025-05-07T20:03:56.8160518Z 2025-05-07T20:03:56.8161000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:56.8161780Z 2025-05-07T20:03:56.8163527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:56.8166495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:56.8167625Z ^ 2025-05-07T20:03:56.8168008Z 2025-05-07T20:03:58.9041590Z [410/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:03:58.9059485Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:59.6302666Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:03:59.6326142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6328776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6329859Z ^ 2025-05-07T20:03:59.6330085Z 2025-05-07T20:03:59.6330507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.6331127Z 2025-05-07T20:03:59.6332797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6335502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6336703Z ^ 2025-05-07T20:03:59.6337107Z 2025-05-07T20:03:59.6338778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6341496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6342702Z ^ 2025-05-07T20:03:59.6342968Z 2025-05-07T20:03:59.6343420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.6344029Z 2025-05-07T20:03:59.6345789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6348485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6349654Z ^ 2025-05-07T20:03:59.6350170Z 2025-05-07T20:03:59.6351968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6354527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6355696Z ^ 2025-05-07T20:03:59.6355959Z 2025-05-07T20:03:59.6356399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.6357076Z 2025-05-07T20:03:59.6358689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6361766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6363006Z ^ 2025-05-07T20:03:59.6363545Z 2025-05-07T20:03:59.6365741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6368402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6369720Z ^ 2025-05-07T20:03:59.6369977Z 2025-05-07T20:03:59.6370440Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.6371083Z 2025-05-07T20:03:59.6372912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6375645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6376801Z ^ 2025-05-07T20:03:59.6377183Z 2025-05-07T20:03:59.6378844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6381500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6382668Z ^ 2025-05-07T20:03:59.6382963Z 2025-05-07T20:03:59.6383399Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:59.6384083Z 2025-05-07T20:03:59.6385766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:59.6388505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:59.6389722Z ^ 2025-05-07T20:03:59.6390102Z 2025-05-07T20:04:00.6059063Z [412/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:00.6078861Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:01.3545987Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:01.3571034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3573893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3575148Z ^ 2025-05-07T20:04:01.3575634Z 2025-05-07T20:04:01.3576103Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.3576817Z 2025-05-07T20:04:01.3578567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3581434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3582683Z ^ 2025-05-07T20:04:01.3583093Z 2025-05-07T20:04:01.3584812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3587518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3588757Z ^ 2025-05-07T20:04:01.3589033Z 2025-05-07T20:04:01.3589527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.3590230Z 2025-05-07T20:04:01.3592122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3594833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3596115Z ^ 2025-05-07T20:04:01.3596668Z 2025-05-07T20:04:01.3598354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3601259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3602551Z ^ 2025-05-07T20:04:01.3602825Z 2025-05-07T20:04:01.3603304Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.3604186Z 2025-05-07T20:04:01.3605950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3608579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3609767Z ^ 2025-05-07T20:04:01.3610145Z 2025-05-07T20:04:01.3611898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3614839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3616090Z ^ 2025-05-07T20:04:01.3616363Z 2025-05-07T20:04:01.3616846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.3617489Z 2025-05-07T20:04:01.3619180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3622012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3623349Z ^ 2025-05-07T20:04:01.3623754Z 2025-05-07T20:04:01.3625483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3628258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3629476Z ^ 2025-05-07T20:04:01.3629770Z 2025-05-07T20:04:01.3630234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:01.3630932Z 2025-05-07T20:04:01.3632739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:01.3635534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:01.3636795Z ^ 2025-05-07T20:04:01.3637174Z 2025-05-07T20:04:01.9212019Z [414/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:04:01.9233035Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:02.6925996Z [415/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:02.6945919Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:04.7399369Z [416/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:04.7421276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7424509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7425577Z ^ 2025-05-07T20:04:04.7425881Z 2025-05-07T20:04:04.7426354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7426971Z 2025-05-07T20:04:04.7428560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7431273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7432664Z ^ 2025-05-07T20:04:04.7433041Z 2025-05-07T20:04:04.7434875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7437645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7438898Z ^ 2025-05-07T20:04:04.7439154Z 2025-05-07T20:04:04.7439635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7440313Z 2025-05-07T20:04:04.7441945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7444693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7445935Z ^ 2025-05-07T20:04:04.7446321Z 2025-05-07T20:04:04.7448054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7451022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7452223Z ^ 2025-05-07T20:04:04.7452517Z 2025-05-07T20:04:04.7452993Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7453694Z 2025-05-07T20:04:04.7455482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7458297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7459534Z ^ 2025-05-07T20:04:04.7460006Z 2025-05-07T20:04:04.7461728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7464486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7466171Z ^ 2025-05-07T20:04:04.7466448Z 2025-05-07T20:04:04.7466917Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7467635Z 2025-05-07T20:04:04.7469377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7472357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7473548Z ^ 2025-05-07T20:04:04.7473966Z 2025-05-07T20:04:04.7475649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7478442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7479674Z ^ 2025-05-07T20:04:04.7479964Z 2025-05-07T20:04:04.7480431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:04.7481132Z 2025-05-07T20:04:04.7482855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:04.7485680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:04.7486895Z ^ 2025-05-07T20:04:04.7487268Z 2025-05-07T20:04:04.8223552Z [417/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:04.8244308Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:05.3456984Z [418/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:05.3476723Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:05.3762775Z [419/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:05.3786148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3788728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3789862Z ^ 2025-05-07T20:04:05.3790143Z 2025-05-07T20:04:05.3790574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.3791225Z 2025-05-07T20:04:05.3793036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3795514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3796699Z ^ 2025-05-07T20:04:05.3797063Z 2025-05-07T20:04:05.3798624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3801487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3802650Z ^ 2025-05-07T20:04:05.3802891Z 2025-05-07T20:04:05.3803318Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.3803939Z 2025-05-07T20:04:05.3805621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3808205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3809348Z ^ 2025-05-07T20:04:05.3809819Z 2025-05-07T20:04:05.3811398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3814124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3815287Z ^ 2025-05-07T20:04:05.3815566Z 2025-05-07T20:04:05.3815993Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.3816633Z 2025-05-07T20:04:05.3818203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3820858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3821960Z ^ 2025-05-07T20:04:05.3822293Z 2025-05-07T20:04:05.3823818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3826317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3827450Z ^ 2025-05-07T20:04:05.3827699Z 2025-05-07T20:04:05.3828124Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.3828795Z 2025-05-07T20:04:05.3830367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3833043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3834167Z ^ 2025-05-07T20:04:05.3834539Z 2025-05-07T20:04:05.3836086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3838709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3839827Z ^ 2025-05-07T20:04:05.3840083Z 2025-05-07T20:04:05.3840523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.3841181Z 2025-05-07T20:04:05.3842792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.3845435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.3846579Z ^ 2025-05-07T20:04:05.3846941Z 2025-05-07T20:04:05.6930828Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:05.6947294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6949184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6950059Z ^ 2025-05-07T20:04:05.6950270Z 2025-05-07T20:04:05.6950583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.6951081Z 2025-05-07T20:04:05.6952401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6954263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6955334Z ^ 2025-05-07T20:04:05.6955624Z 2025-05-07T20:04:05.6956742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6958558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6959371Z ^ 2025-05-07T20:04:05.6959557Z 2025-05-07T20:04:05.6959911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.6960392Z 2025-05-07T20:04:05.6961545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6963548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6964390Z ^ 2025-05-07T20:04:05.6964648Z 2025-05-07T20:04:05.6966205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6968061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6968902Z ^ 2025-05-07T20:04:05.6969088Z 2025-05-07T20:04:05.6969518Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.6970020Z 2025-05-07T20:04:05.6971150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6973031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6973852Z ^ 2025-05-07T20:04:05.6974127Z 2025-05-07T20:04:05.6975431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6977248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6978105Z ^ 2025-05-07T20:04:05.6978315Z 2025-05-07T20:04:05.6978674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.6979142Z 2025-05-07T20:04:05.6980285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6982167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6983038Z ^ 2025-05-07T20:04:05.6983291Z 2025-05-07T20:04:05.6984413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6986435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6987302Z ^ 2025-05-07T20:04:05.6987669Z 2025-05-07T20:04:05.6987984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:05.6988439Z 2025-05-07T20:04:05.6989609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:05.6991440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:05.6992438Z ^ 2025-05-07T20:04:05.6992691Z 2025-05-07T20:04:07.7447540Z [421/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:07.7468613Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:07.8068289Z [422/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:07.8089947Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:08.2706976Z [423/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:08.2726554Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:08.5124420Z [424/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:08.5143403Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:08.7133317Z [425/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:08.7152090Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.0999474Z [426/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:09.1015780Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.2009554Z [427/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:09.2022115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2023571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2024248Z ^ 2025-05-07T20:04:09.2024400Z 2025-05-07T20:04:09.2024736Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.2025105Z 2025-05-07T20:04:09.2025989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2027419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2028097Z ^ 2025-05-07T20:04:09.2028307Z 2025-05-07T20:04:09.2029179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2030614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2031249Z ^ 2025-05-07T20:04:09.2031425Z 2025-05-07T20:04:09.2031845Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.2032213Z 2025-05-07T20:04:09.2033119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2034540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2035211Z ^ 2025-05-07T20:04:09.2035427Z 2025-05-07T20:04:09.2036325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2037816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2038481Z ^ 2025-05-07T20:04:09.2038635Z 2025-05-07T20:04:09.2038891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.2039285Z 2025-05-07T20:04:09.2040165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2041604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2044936Z ^ 2025-05-07T20:04:09.2045153Z 2025-05-07T20:04:09.2046089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2047543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2048211Z ^ 2025-05-07T20:04:09.2048371Z 2025-05-07T20:04:09.2048627Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.2049024Z 2025-05-07T20:04:09.2049955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2051399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2052057Z ^ 2025-05-07T20:04:09.2052296Z 2025-05-07T20:04:09.2053163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2054604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2055249Z ^ 2025-05-07T20:04:09.2055409Z 2025-05-07T20:04:09.2055692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:09.2056063Z 2025-05-07T20:04:09.2056944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:09.2058391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:09.2059077Z ^ 2025-05-07T20:04:09.2059280Z 2025-05-07T20:04:09.5450725Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:09.5470219Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:09.6328764Z [429/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:09.6349292Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:10.4541160Z [430/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:10.4563360Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:10.8083708Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:10.8104134Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:11.7903629Z [432/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:11.7925129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7927823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7928973Z ^ 2025-05-07T20:04:11.7929589Z 2025-05-07T20:04:11.7930015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.7930595Z 2025-05-07T20:04:11.7932108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7934684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7935841Z ^ 2025-05-07T20:04:11.7936220Z 2025-05-07T20:04:11.7937721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7940502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7941676Z ^ 2025-05-07T20:04:11.7941934Z 2025-05-07T20:04:11.7942353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.7942978Z 2025-05-07T20:04:11.7944496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7946931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7948016Z ^ 2025-05-07T20:04:11.7948516Z 2025-05-07T20:04:11.7950036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7952707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7953738Z ^ 2025-05-07T20:04:11.7954021Z 2025-05-07T20:04:11.7954459Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.7955044Z 2025-05-07T20:04:11.7956640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7959148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7960215Z ^ 2025-05-07T20:04:11.7960555Z 2025-05-07T20:04:11.7961979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7964553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7965981Z ^ 2025-05-07T20:04:11.7966226Z 2025-05-07T20:04:11.7966684Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.7967312Z 2025-05-07T20:04:11.7968868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7971422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7972772Z ^ 2025-05-07T20:04:11.7973139Z 2025-05-07T20:04:11.7974706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7977292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7978414Z ^ 2025-05-07T20:04:11.7978654Z 2025-05-07T20:04:11.7979114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:11.7979759Z 2025-05-07T20:04:11.7981333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:11.7984271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:11.7985447Z ^ 2025-05-07T20:04:11.7985801Z 2025-05-07T20:04:12.1248940Z [433/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:12.1267492Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:13.9427421Z [434/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:13.9448022Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.2280319Z [435/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:14.2300261Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.3462310Z [436/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:14.3482706Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.4472220Z [437/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:14.4496820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4499961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4501078Z ^ 2025-05-07T20:04:14.4501344Z 2025-05-07T20:04:14.4501781Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4502482Z 2025-05-07T20:04:14.4504155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4506810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4508034Z ^ 2025-05-07T20:04:14.4508408Z 2025-05-07T20:04:14.4510144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4513013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4514242Z ^ 2025-05-07T20:04:14.4514518Z 2025-05-07T20:04:14.4514994Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4515690Z 2025-05-07T20:04:14.4517382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4519879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4521047Z ^ 2025-05-07T20:04:14.4521434Z 2025-05-07T20:04:14.4523055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4525498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4526831Z ^ 2025-05-07T20:04:14.4527099Z 2025-05-07T20:04:14.4527526Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4528174Z 2025-05-07T20:04:14.4529817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4532428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4533672Z ^ 2025-05-07T20:04:14.4534037Z 2025-05-07T20:04:14.4535858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4538586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4539833Z ^ 2025-05-07T20:04:14.4540209Z 2025-05-07T20:04:14.4540669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4541402Z 2025-05-07T20:04:14.4543121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4545902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4547038Z ^ 2025-05-07T20:04:14.4547409Z 2025-05-07T20:04:14.4548888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4551698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4552853Z ^ 2025-05-07T20:04:14.4553126Z 2025-05-07T20:04:14.4553569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:14.4554266Z 2025-05-07T20:04:14.4555997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:14.4558695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:14.4559915Z ^ 2025-05-07T20:04:14.4560276Z 2025-05-07T20:04:14.6403981Z [438/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:14.6424515Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:14.8328486Z [439/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:14.8349066Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.3461145Z [440/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:15.3483844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3486455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3487598Z ^ 2025-05-07T20:04:15.3487854Z 2025-05-07T20:04:15.3488298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:15.3488921Z 2025-05-07T20:04:15.3490396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3492907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3494115Z ^ 2025-05-07T20:04:15.3494483Z 2025-05-07T20:04:15.3495997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3498513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3499926Z ^ 2025-05-07T20:04:15.3500206Z 2025-05-07T20:04:15.3500611Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:15.3501386Z 2025-05-07T20:04:15.3502938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3505459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3506540Z ^ 2025-05-07T20:04:15.3506883Z 2025-05-07T20:04:15.3508397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3511196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3512482Z ^ 2025-05-07T20:04:15.3512742Z 2025-05-07T20:04:15.3513378Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:15.3514021Z 2025-05-07T20:04:15.3515526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3518272Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3519402Z ^ 2025-05-07T20:04:15.3519746Z 2025-05-07T20:04:15.3521437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3523918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3524996Z ^ 2025-05-07T20:04:15.3525235Z 2025-05-07T20:04:15.3525670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:15.3526273Z 2025-05-07T20:04:15.3527818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3530356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3531500Z ^ 2025-05-07T20:04:15.3531858Z 2025-05-07T20:04:15.3533375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3535880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3536980Z ^ 2025-05-07T20:04:15.3537255Z 2025-05-07T20:04:15.3537648Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:15.3538246Z 2025-05-07T20:04:15.3539834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:15.3542356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:15.3543691Z ^ 2025-05-07T20:04:15.3544045Z 2025-05-07T20:04:15.7452905Z [441/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:15.7471958Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:16.7034405Z [442/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:16.7052608Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.1515793Z [443/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:17.1536641Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.4437569Z [444/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:17.4458957Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.8871963Z [445/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:17.8889627Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.0145080Z [446/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:18.0166432Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.0775479Z [447/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:18.0798110Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.0948031Z [448/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:18.0971233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.0973979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.0975475Z ^ 2025-05-07T20:04:18.0975738Z 2025-05-07T20:04:18.0976205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.0976918Z 2025-05-07T20:04:18.0978626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.0981368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.0982560Z ^ 2025-05-07T20:04:18.0982922Z 2025-05-07T20:04:18.0984364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.0987159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.0988303Z ^ 2025-05-07T20:04:18.0988586Z 2025-05-07T20:04:18.0989178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.0989843Z 2025-05-07T20:04:18.0991657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.0994314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.0995657Z ^ 2025-05-07T20:04:18.0996021Z 2025-05-07T20:04:18.0997641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.1000286Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.1001474Z ^ 2025-05-07T20:04:18.1001731Z 2025-05-07T20:04:18.1002166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.1002835Z 2025-05-07T20:04:18.1004492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.1007247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.1008396Z ^ 2025-05-07T20:04:18.1008763Z 2025-05-07T20:04:18.1010432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.1013177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.1014331Z ^ 2025-05-07T20:04:18.1014591Z 2025-05-07T20:04:18.1015046Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.1015670Z 2025-05-07T20:04:18.1017241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.1019879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.1021233Z ^ 2025-05-07T20:04:18.1021606Z 2025-05-07T20:04:18.1023178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.1025744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.1026965Z ^ 2025-05-07T20:04:18.1027229Z 2025-05-07T20:04:18.1027690Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.1028517Z 2025-05-07T20:04:18.1030180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.1033352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.1034554Z ^ 2025-05-07T20:04:18.1035058Z 2025-05-07T20:04:18.1052491Z [449/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:18.1073015Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.2767017Z [450/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:18.2785923Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.5658022Z [451/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:18.5683070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5686199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5687431Z ^ 2025-05-07T20:04:18.5687681Z 2025-05-07T20:04:18.5688136Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.5688831Z 2025-05-07T20:04:18.5690608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5693329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5694549Z ^ 2025-05-07T20:04:18.5695069Z 2025-05-07T20:04:18.5696806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5699542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5700716Z ^ 2025-05-07T20:04:18.5701011Z 2025-05-07T20:04:18.5701472Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.5702156Z 2025-05-07T20:04:18.5703845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5706510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5707736Z ^ 2025-05-07T20:04:18.5708108Z 2025-05-07T20:04:18.5709832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5712649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5713851Z ^ 2025-05-07T20:04:18.5714108Z 2025-05-07T20:04:18.5714586Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.5715260Z 2025-05-07T20:04:18.5716982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5719677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5721032Z ^ 2025-05-07T20:04:18.5721422Z 2025-05-07T20:04:18.5723079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5725804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5726947Z ^ 2025-05-07T20:04:18.5727228Z 2025-05-07T20:04:18.5727682Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.5728350Z 2025-05-07T20:04:18.5730091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5733033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5734162Z ^ 2025-05-07T20:04:18.5734519Z 2025-05-07T20:04:18.5736448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5739225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5740491Z ^ 2025-05-07T20:04:18.5740756Z 2025-05-07T20:04:18.5741268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.5741934Z 2025-05-07T20:04:18.5743571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.5746433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.5747562Z ^ 2025-05-07T20:04:18.5747944Z 2025-05-07T20:04:18.6729924Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:18.6750381Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.6882919Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:18.6903847Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.0789752Z [454/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:19.0813361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0816433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0817599Z ^ 2025-05-07T20:04:19.0817832Z 2025-05-07T20:04:19.0818268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.0818819Z 2025-05-07T20:04:19.0820359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0822959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0824036Z ^ 2025-05-07T20:04:19.0824404Z 2025-05-07T20:04:19.0825899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0828549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0829633Z ^ 2025-05-07T20:04:19.0829914Z 2025-05-07T20:04:19.0830355Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.0830982Z 2025-05-07T20:04:19.0832796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0835403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0836821Z ^ 2025-05-07T20:04:19.0837168Z 2025-05-07T20:04:19.0838739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0841326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0842564Z ^ 2025-05-07T20:04:19.0842831Z 2025-05-07T20:04:19.0843300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.0843985Z 2025-05-07T20:04:19.0845771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0848311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0849624Z ^ 2025-05-07T20:04:19.0850024Z 2025-05-07T20:04:19.0851717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0854464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0855650Z ^ 2025-05-07T20:04:19.0855922Z 2025-05-07T20:04:19.0856340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.0856954Z 2025-05-07T20:04:19.0858455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0860964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0862121Z ^ 2025-05-07T20:04:19.0862447Z 2025-05-07T20:04:19.0864108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0867166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0868261Z ^ 2025-05-07T20:04:19.0868516Z 2025-05-07T20:04:19.0868941Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.0869603Z 2025-05-07T20:04:19.0871243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.0874146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.0875338Z ^ 2025-05-07T20:04:19.0875712Z 2025-05-07T20:04:19.6985813Z [455/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:19.7009822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7012634Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7013900Z ^ 2025-05-07T20:04:19.7014127Z 2025-05-07T20:04:19.7014564Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7015190Z 2025-05-07T20:04:19.7016802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7019509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7020710Z ^ 2025-05-07T20:04:19.7021096Z 2025-05-07T20:04:19.7022680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7025254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7026395Z ^ 2025-05-07T20:04:19.7026686Z 2025-05-07T20:04:19.7027126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7027989Z 2025-05-07T20:04:19.7029655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7032542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7033673Z ^ 2025-05-07T20:04:19.7034009Z 2025-05-07T20:04:19.7035499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7037818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7039036Z ^ 2025-05-07T20:04:19.7039253Z 2025-05-07T20:04:19.7039620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7040256Z 2025-05-07T20:04:19.7041832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7044339Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7045472Z ^ 2025-05-07T20:04:19.7045835Z 2025-05-07T20:04:19.7047595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7050284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7051446Z ^ 2025-05-07T20:04:19.7051697Z 2025-05-07T20:04:19.7052204Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7052856Z 2025-05-07T20:04:19.7054438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7057060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7058231Z ^ 2025-05-07T20:04:19.7058590Z 2025-05-07T20:04:19.7060203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7062948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7064128Z ^ 2025-05-07T20:04:19.7064367Z 2025-05-07T20:04:19.7065095Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7065821Z 2025-05-07T20:04:19.7067487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7070105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7071283Z ^ 2025-05-07T20:04:19.7071744Z 2025-05-07T20:04:19.7742436Z [456/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:19.7765550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7768027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7769173Z ^ 2025-05-07T20:04:19.7769416Z 2025-05-07T20:04:19.7769825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7770409Z 2025-05-07T20:04:19.7772049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7774823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7776012Z ^ 2025-05-07T20:04:19.7776374Z 2025-05-07T20:04:19.7778127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7780976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7782124Z ^ 2025-05-07T20:04:19.7782381Z 2025-05-07T20:04:19.7782817Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7783494Z 2025-05-07T20:04:19.7785143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7787812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7789041Z ^ 2025-05-07T20:04:19.7789586Z 2025-05-07T20:04:19.7791247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7794033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7795360Z ^ 2025-05-07T20:04:19.7795626Z 2025-05-07T20:04:19.7796067Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7796732Z 2025-05-07T20:04:19.7798292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7801162Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7802396Z ^ 2025-05-07T20:04:19.7802753Z 2025-05-07T20:04:19.7804432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7807098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7808295Z ^ 2025-05-07T20:04:19.7808554Z 2025-05-07T20:04:19.7809002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7809682Z 2025-05-07T20:04:19.7811406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7814045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7815196Z ^ 2025-05-07T20:04:19.7815540Z 2025-05-07T20:04:19.7817206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7819828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7821019Z ^ 2025-05-07T20:04:19.7821265Z 2025-05-07T20:04:19.7821730Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:19.7822411Z 2025-05-07T20:04:19.7824090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:19.7826939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:19.7828134Z ^ 2025-05-07T20:04:19.7828542Z 2025-05-07T20:04:21.3244533Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:21.3262718Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:21.9694918Z [458/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:24.8399735Z [459/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:24.8417271Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:25.9980925Z [460/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:26.0004365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0007124Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0008297Z ^ 2025-05-07T20:04:26.0008548Z 2025-05-07T20:04:26.0009007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0009718Z 2025-05-07T20:04:26.0011462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0014297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0015532Z ^ 2025-05-07T20:04:26.0015933Z 2025-05-07T20:04:26.0017655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0020469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0021696Z ^ 2025-05-07T20:04:26.0021976Z 2025-05-07T20:04:26.0022444Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0023131Z 2025-05-07T20:04:26.0024911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0027712Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0028995Z ^ 2025-05-07T20:04:26.0029376Z 2025-05-07T20:04:26.0031154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0034248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0035675Z ^ 2025-05-07T20:04:26.0035944Z 2025-05-07T20:04:26.0036443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0037142Z 2025-05-07T20:04:26.0038877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0041704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0045860Z ^ 2025-05-07T20:04:26.0046256Z 2025-05-07T20:04:26.0047974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0050893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0052122Z ^ 2025-05-07T20:04:26.0052414Z 2025-05-07T20:04:26.0052888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0053590Z 2025-05-07T20:04:26.0055449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0058278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0059558Z ^ 2025-05-07T20:04:26.0059940Z 2025-05-07T20:04:26.0061502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0064202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0065589Z ^ 2025-05-07T20:04:26.0065840Z 2025-05-07T20:04:26.0066287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0066962Z 2025-05-07T20:04:26.0068503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0071334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0072644Z ^ 2025-05-07T20:04:26.0073022Z 2025-05-07T20:04:26.0886504Z [461/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:26.0911116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0914069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0915206Z ^ 2025-05-07T20:04:26.0915460Z 2025-05-07T20:04:26.0915888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0916559Z 2025-05-07T20:04:26.0918236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0920490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0921570Z ^ 2025-05-07T20:04:26.0921893Z 2025-05-07T20:04:26.0923492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0926099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0927310Z ^ 2025-05-07T20:04:26.0927581Z 2025-05-07T20:04:26.0928083Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0928786Z 2025-05-07T20:04:26.0930375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0933384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0934629Z ^ 2025-05-07T20:04:26.0934974Z 2025-05-07T20:04:26.0936637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0939377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0940603Z ^ 2025-05-07T20:04:26.0940862Z 2025-05-07T20:04:26.0941327Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0942174Z 2025-05-07T20:04:26.0943784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0946635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0947876Z ^ 2025-05-07T20:04:26.0948244Z 2025-05-07T20:04:26.0949949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0953109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0954330Z ^ 2025-05-07T20:04:26.0954581Z 2025-05-07T20:04:26.0955050Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0955748Z 2025-05-07T20:04:26.0957493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0960198Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0961405Z ^ 2025-05-07T20:04:26.0961791Z 2025-05-07T20:04:26.0963520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0966520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0967735Z ^ 2025-05-07T20:04:26.0968008Z 2025-05-07T20:04:26.0968469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.0969154Z 2025-05-07T20:04:26.0970919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.0973703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.0974932Z ^ 2025-05-07T20:04:26.0975300Z 2025-05-07T20:04:26.2416402Z [462/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:26.2435585Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.8697240Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:26.8713394Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.9260543Z [464/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:26.9279051Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.4250198Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:28.4270845Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.5690089Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:28.5710477Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.8416385Z [467/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:28.8434090Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:29.2858064Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:29.2876136Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.2109233Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:30.2127866Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.2759502Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:30.2775915Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.9993587Z [471/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:31.0011677Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.2556206Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:31.2573046Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.5203607Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:31.5223089Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.4266007Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:32.4286522Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.4556056Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:32.4574787Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.5277739Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:32.5296673Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.9128661Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:34.9147613Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:35.1238086Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:35.1256503Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:36.7951576Z [479/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:36.7971546Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:37.2521465Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:04:37.2537898Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:37.6603936Z [481/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:04:37.6621859Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:40.3233321Z [482/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:04:40.3251098Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:41.1137442Z [483/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:04:41.1153704Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:41.4811506Z [484/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:04:41.4827315Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:41.5497739Z [485/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:04:41.5515565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5517720Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5518597Z ^ 2025-05-07T20:04:41.5518989Z 2025-05-07T20:04:41.5519333Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:41.5519861Z 2025-05-07T20:04:41.5521032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5522976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5523823Z ^ 2025-05-07T20:04:41.5524090Z 2025-05-07T20:04:41.5525276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5527154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5528032Z ^ 2025-05-07T20:04:41.5528223Z 2025-05-07T20:04:41.5528599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:41.5529099Z 2025-05-07T20:04:41.5530371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5532355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5533253Z ^ 2025-05-07T20:04:41.5533529Z 2025-05-07T20:04:41.5534715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5536865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5537918Z ^ 2025-05-07T20:04:41.5538120Z 2025-05-07T20:04:41.5538466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:41.5539002Z 2025-05-07T20:04:41.5540259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5542297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5543190Z ^ 2025-05-07T20:04:41.5543461Z 2025-05-07T20:04:41.5544678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5546740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5547617Z ^ 2025-05-07T20:04:41.5547812Z 2025-05-07T20:04:41.5548387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:41.5548883Z 2025-05-07T20:04:41.5550101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5553045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5554013Z ^ 2025-05-07T20:04:41.5554303Z 2025-05-07T20:04:41.5555491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5557624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5558475Z ^ 2025-05-07T20:04:41.5558711Z 2025-05-07T20:04:41.5559036Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:41.5559512Z 2025-05-07T20:04:41.5560712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:41.5575252Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:41.5576193Z ^ 2025-05-07T20:04:41.5576474Z 2025-05-07T20:04:41.5848282Z [486/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:04:41.5862455Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.1602197Z [487/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:04:42.1616695Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.4021686Z [488/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:04:42.4041155Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:43.6546635Z [489/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:43.6566576Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:43.9582562Z [490/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:04:43.9602162Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:44.3734256Z [491/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:04:44.3753443Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:44.7765453Z [492/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:04:44.7789568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7792433Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7793564Z ^ 2025-05-07T20:04:44.7793863Z 2025-05-07T20:04:44.7794303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.7794972Z 2025-05-07T20:04:44.7796668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7799418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7800639Z ^ 2025-05-07T20:04:44.7801008Z 2025-05-07T20:04:44.7802667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7805666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7806828Z ^ 2025-05-07T20:04:44.7807067Z 2025-05-07T20:04:44.7807517Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.7808215Z 2025-05-07T20:04:44.7809900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7812528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7813959Z ^ 2025-05-07T20:04:44.7814337Z 2025-05-07T20:04:44.7815953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7818922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7820135Z ^ 2025-05-07T20:04:44.7820402Z 2025-05-07T20:04:44.7820894Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.7821551Z 2025-05-07T20:04:44.7823132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7825709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7826894Z ^ 2025-05-07T20:04:44.7827266Z 2025-05-07T20:04:44.7828944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7831589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7832876Z ^ 2025-05-07T20:04:44.7833132Z 2025-05-07T20:04:44.7833576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.7834249Z 2025-05-07T20:04:44.7835899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7838354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7839582Z ^ 2025-05-07T20:04:44.7839951Z 2025-05-07T20:04:44.7841694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7844406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7845610Z ^ 2025-05-07T20:04:44.7845865Z 2025-05-07T20:04:44.7846349Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.7847030Z 2025-05-07T20:04:44.7848728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.7850981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.7851912Z ^ 2025-05-07T20:04:44.7852234Z 2025-05-07T20:04:44.8838334Z [493/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:04:44.8862646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8865414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8866581Z ^ 2025-05-07T20:04:44.8866844Z 2025-05-07T20:04:44.8867256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.8867908Z 2025-05-07T20:04:44.8869602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8872399Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8873765Z ^ 2025-05-07T20:04:44.8874166Z 2025-05-07T20:04:44.8875812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8878522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8879644Z ^ 2025-05-07T20:04:44.8879907Z 2025-05-07T20:04:44.8880395Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.8881058Z 2025-05-07T20:04:44.8882668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8885570Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8886794Z ^ 2025-05-07T20:04:44.8887170Z 2025-05-07T20:04:44.8888857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8891513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8892720Z ^ 2025-05-07T20:04:44.8893145Z 2025-05-07T20:04:44.8893720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.8894417Z 2025-05-07T20:04:44.8896070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8898816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8900020Z ^ 2025-05-07T20:04:44.8900404Z 2025-05-07T20:04:44.8902127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8904868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8906084Z ^ 2025-05-07T20:04:44.8906367Z 2025-05-07T20:04:44.8907009Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.8907667Z 2025-05-07T20:04:44.8909106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8911594Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8912909Z ^ 2025-05-07T20:04:44.8913435Z 2025-05-07T20:04:44.8915128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8917880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8919076Z ^ 2025-05-07T20:04:44.8919501Z 2025-05-07T20:04:44.8919952Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.8920634Z 2025-05-07T20:04:44.8922377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.8924916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.8926124Z ^ 2025-05-07T20:04:44.8926497Z 2025-05-07T20:04:44.9570919Z [494/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:04:44.9590500Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:46.0806577Z [495/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:04:46.0827828Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:46.3942270Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:04:46.3962226Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.6080076Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:04:47.6099092Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.9841554Z [498/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:04:47.9860560Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:50.1522900Z [499/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:04:50.1549216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1552281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1553478Z ^ 2025-05-07T20:04:50.1553759Z 2025-05-07T20:04:50.1554258Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.1554927Z 2025-05-07T20:04:50.1556210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1558689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1559829Z ^ 2025-05-07T20:04:50.1560185Z 2025-05-07T20:04:50.1561835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1564436Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1566046Z ^ 2025-05-07T20:04:50.1566311Z 2025-05-07T20:04:50.1566747Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.1567461Z 2025-05-07T20:04:50.1569106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1571841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1573071Z ^ 2025-05-07T20:04:50.1573466Z 2025-05-07T20:04:50.1575045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1577896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1579096Z ^ 2025-05-07T20:04:50.1579496Z 2025-05-07T20:04:50.1579979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.1580646Z 2025-05-07T20:04:50.1582295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1585073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1586302Z ^ 2025-05-07T20:04:50.1586672Z 2025-05-07T20:04:50.1588349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1591026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1592386Z ^ 2025-05-07T20:04:50.1592674Z 2025-05-07T20:04:50.1593116Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.1593912Z 2025-05-07T20:04:50.1595827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1598286Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1599548Z ^ 2025-05-07T20:04:50.1599926Z 2025-05-07T20:04:50.1601598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1604294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1605490Z ^ 2025-05-07T20:04:50.1605763Z 2025-05-07T20:04:50.1606232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:50.1606944Z 2025-05-07T20:04:50.1608694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:50.1611362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:50.1612447Z ^ 2025-05-07T20:04:50.1612946Z 2025-05-07T20:04:50.2778875Z [500/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:04:50.2797854Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:53.5299955Z [501/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:04:53.5324308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5327067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5328016Z ^ 2025-05-07T20:04:53.5328268Z 2025-05-07T20:04:53.5328661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.5329238Z 2025-05-07T20:04:53.5330982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5333584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5334789Z ^ 2025-05-07T20:04:53.5335186Z 2025-05-07T20:04:53.5336821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5339615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5340834Z ^ 2025-05-07T20:04:53.5341106Z 2025-05-07T20:04:53.5341476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.5342118Z 2025-05-07T20:04:53.5343845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5346497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5347688Z ^ 2025-05-07T20:04:53.5348047Z 2025-05-07T20:04:53.5349724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5352497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5353727Z ^ 2025-05-07T20:04:53.5354001Z 2025-05-07T20:04:53.5354486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.5355218Z 2025-05-07T20:04:53.5356859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5359953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5361188Z ^ 2025-05-07T20:04:53.5361592Z 2025-05-07T20:04:53.5363230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5366184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5367576Z ^ 2025-05-07T20:04:53.5367871Z 2025-05-07T20:04:53.5368341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.5369046Z 2025-05-07T20:04:53.5370918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5373685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5374947Z ^ 2025-05-07T20:04:53.5375323Z 2025-05-07T20:04:53.5377139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5379933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5381157Z ^ 2025-05-07T20:04:53.5381420Z 2025-05-07T20:04:53.5381876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:53.5382580Z 2025-05-07T20:04:53.5384310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:53.5387074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:53.5388222Z ^ 2025-05-07T20:04:53.5388622Z 2025-05-07T20:04:54.9367124Z [502/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:04:54.9389887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9392773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9393986Z ^ 2025-05-07T20:04:54.9394257Z 2025-05-07T20:04:54.9394877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9395621Z 2025-05-07T20:04:54.9397253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9399868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9401048Z ^ 2025-05-07T20:04:54.9401429Z 2025-05-07T20:04:54.9403059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9405694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9406845Z ^ 2025-05-07T20:04:54.9407102Z 2025-05-07T20:04:54.9407581Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9408266Z 2025-05-07T20:04:54.9410031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9412754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9413999Z ^ 2025-05-07T20:04:54.9414372Z 2025-05-07T20:04:54.9415816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:54.9417649Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:54.9418243Z ^ 2025-05-07T20:04:54.9418507Z 2025-05-07T20:04:54.9420172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9423206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9424440Z ^ 2025-05-07T20:04:54.9424726Z 2025-05-07T20:04:54.9425199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9425900Z 2025-05-07T20:04:54.9427650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9430553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9431969Z ^ 2025-05-07T20:04:54.9432348Z 2025-05-07T20:04:54.9433883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:54.9435746Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:54.9436349Z ^ 2025-05-07T20:04:54.9436623Z 2025-05-07T20:04:54.9438328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9441312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9442499Z ^ 2025-05-07T20:04:54.9442757Z 2025-05-07T20:04:54.9443223Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9443960Z 2025-05-07T20:04:54.9445679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9448509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9449747Z ^ 2025-05-07T20:04:54.9450130Z 2025-05-07T20:04:54.9451607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:54.9453473Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:54.9454067Z ^ 2025-05-07T20:04:54.9454344Z 2025-05-07T20:04:54.9456094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9458857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9459964Z ^ 2025-05-07T20:04:54.9460188Z 2025-05-07T20:04:54.9460602Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.9461315Z 2025-05-07T20:04:54.9463042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.9466048Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.9467468Z ^ 2025-05-07T20:04:54.9467827Z 2025-05-07T20:04:54.9469037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:04:54.9470901Z const auto offset_idx = idx * D_emb; 2025-05-07T20:04:54.9471455Z ^ 2025-05-07T20:04:54.9471845Z 2025-05-07T20:04:56.2479656Z [503/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:56.2492250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2493667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2494310Z ^ 2025-05-07T20:04:56.2494453Z 2025-05-07T20:04:56.2494703Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.2495077Z 2025-05-07T20:04:56.2495942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2497463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2498097Z ^ 2025-05-07T20:04:56.2498310Z 2025-05-07T20:04:56.2499164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2500561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2501177Z ^ 2025-05-07T20:04:56.2501373Z 2025-05-07T20:04:56.2501633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.2501989Z 2025-05-07T20:04:56.2502851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2504291Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2504938Z ^ 2025-05-07T20:04:56.2505133Z 2025-05-07T20:04:56.2505991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2507422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2508064Z ^ 2025-05-07T20:04:56.2508206Z 2025-05-07T20:04:56.2508446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.2508799Z 2025-05-07T20:04:56.2509679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2511071Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2511842Z ^ 2025-05-07T20:04:56.2512044Z 2025-05-07T20:04:56.2512913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2514290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2514922Z ^ 2025-05-07T20:04:56.2515062Z 2025-05-07T20:04:56.2515315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.2515675Z 2025-05-07T20:04:56.2516537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2517939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2518571Z ^ 2025-05-07T20:04:56.2518780Z 2025-05-07T20:04:56.2519637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2521156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2521775Z ^ 2025-05-07T20:04:56.2521929Z 2025-05-07T20:04:56.2522168Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:56.2522519Z 2025-05-07T20:04:56.2523398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:56.2524798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:56.2525480Z ^ 2025-05-07T20:04:56.2525677Z 2025-05-07T20:04:58.1567521Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:58.1591477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1594361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1595733Z ^ 2025-05-07T20:04:58.1595972Z 2025-05-07T20:04:58.1596408Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:58.1597022Z 2025-05-07T20:04:58.1598675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1601472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1602665Z ^ 2025-05-07T20:04:58.1603050Z 2025-05-07T20:04:58.1604605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1606838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1607800Z ^ 2025-05-07T20:04:58.1608054Z 2025-05-07T20:04:58.1608546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:58.1609134Z 2025-05-07T20:04:58.1610556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1613160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1614314Z ^ 2025-05-07T20:04:58.1614672Z 2025-05-07T20:04:58.1616218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1618908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1620074Z ^ 2025-05-07T20:04:58.1620321Z 2025-05-07T20:04:58.1620753Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:58.1621424Z 2025-05-07T20:04:58.1623169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1625986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1627089Z ^ 2025-05-07T20:04:58.1627441Z 2025-05-07T20:04:58.1629022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1631786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1632932Z ^ 2025-05-07T20:04:58.1633174Z 2025-05-07T20:04:58.1633633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:58.1634485Z 2025-05-07T20:04:58.1636134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1638805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1639973Z ^ 2025-05-07T20:04:58.1640324Z 2025-05-07T20:04:58.1641967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1644776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1645991Z ^ 2025-05-07T20:04:58.1646245Z 2025-05-07T20:04:58.1646708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:58.1647424Z 2025-05-07T20:04:58.1649103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:58.1651849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:58.1653155Z ^ 2025-05-07T20:04:58.1653502Z 2025-05-07T20:04:58.7844004Z [505/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:04:58.7862835Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:00.5834530Z [506/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:00.5858286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5860990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5862147Z ^ 2025-05-07T20:05:00.5862412Z 2025-05-07T20:05:00.5862847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.5863512Z 2025-05-07T20:05:00.5865618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5868242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5869395Z ^ 2025-05-07T20:05:00.5869760Z 2025-05-07T20:05:00.5871417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5874188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5875318Z ^ 2025-05-07T20:05:00.5875571Z 2025-05-07T20:05:00.5876023Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.5876678Z 2025-05-07T20:05:00.5878324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5881138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5882277Z ^ 2025-05-07T20:05:00.5882639Z 2025-05-07T20:05:00.5884321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5887012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5888153Z ^ 2025-05-07T20:05:00.5888536Z 2025-05-07T20:05:00.5888992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.5889651Z 2025-05-07T20:05:00.5891325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5895150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5896344Z ^ 2025-05-07T20:05:00.5896720Z 2025-05-07T20:05:00.5898573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5901334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5902536Z ^ 2025-05-07T20:05:00.5902812Z 2025-05-07T20:05:00.5903270Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.5903940Z 2025-05-07T20:05:00.5905665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5908407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5909623Z ^ 2025-05-07T20:05:00.5909992Z 2025-05-07T20:05:00.5911818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5914378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5915549Z ^ 2025-05-07T20:05:00.5915795Z 2025-05-07T20:05:00.5916236Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:00.5916927Z 2025-05-07T20:05:00.5918628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:00.5921359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:00.5922503Z ^ 2025-05-07T20:05:00.5922886Z 2025-05-07T20:05:08.0400379Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:08.0421172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0423225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0424185Z ^ 2025-05-07T20:05:08.0424424Z 2025-05-07T20:05:08.0424825Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:08.0425438Z 2025-05-07T20:05:08.0426966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0429490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0430573Z ^ 2025-05-07T20:05:08.0430908Z 2025-05-07T20:05:08.0432615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0435102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0436430Z ^ 2025-05-07T20:05:08.0436656Z 2025-05-07T20:05:08.0437080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:08.0437716Z 2025-05-07T20:05:08.0439307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0441798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0442875Z ^ 2025-05-07T20:05:08.0443209Z 2025-05-07T20:05:08.0444777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0447417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0448540Z ^ 2025-05-07T20:05:08.0448789Z 2025-05-07T20:05:08.0449338Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:08.0449968Z 2025-05-07T20:05:08.0451463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0453831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0454995Z ^ 2025-05-07T20:05:08.0455334Z 2025-05-07T20:05:08.0456798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0459196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0460255Z ^ 2025-05-07T20:05:08.0460498Z 2025-05-07T20:05:08.0460891Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:08.0461504Z 2025-05-07T20:05:08.0463030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0465843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0466920Z ^ 2025-05-07T20:05:08.0467280Z 2025-05-07T20:05:08.0468658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0470998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0472167Z ^ 2025-05-07T20:05:08.0472398Z 2025-05-07T20:05:08.0472809Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:08.0473406Z 2025-05-07T20:05:08.0474884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:08.0477246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:08.0478552Z ^ 2025-05-07T20:05:08.0478883Z 2025-05-07T20:05:12.4707462Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:12.4735790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4739418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4740994Z ^ 2025-05-07T20:05:12.4741320Z 2025-05-07T20:05:12.4741896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.4742729Z 2025-05-07T20:05:12.4744702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4748003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4749594Z ^ 2025-05-07T20:05:12.4750082Z 2025-05-07T20:05:12.4752346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4756154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4757613Z ^ 2025-05-07T20:05:12.4757942Z 2025-05-07T20:05:12.4758476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.4759275Z 2025-05-07T20:05:12.4761294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4764548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4766413Z ^ 2025-05-07T20:05:12.4766862Z 2025-05-07T20:05:12.4768552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:12.4770716Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:12.4771627Z ^ 2025-05-07T20:05:12.4771933Z 2025-05-07T20:05:12.4773998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4777455Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4778927Z ^ 2025-05-07T20:05:12.4779235Z 2025-05-07T20:05:12.4779795Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.4780626Z 2025-05-07T20:05:12.4782668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4785979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4787410Z ^ 2025-05-07T20:05:12.4787856Z 2025-05-07T20:05:12.4789589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:12.4791871Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:12.4792563Z ^ 2025-05-07T20:05:12.4792884Z 2025-05-07T20:05:12.4794920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4798453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4799989Z ^ 2025-05-07T20:05:12.4800297Z 2025-05-07T20:05:12.4800865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.4801733Z 2025-05-07T20:05:12.4803716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4807024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4808496Z ^ 2025-05-07T20:05:12.4809138Z 2025-05-07T20:05:12.4810816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:12.4813126Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:12.4813819Z ^ 2025-05-07T20:05:12.4814176Z 2025-05-07T20:05:12.4816333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4819831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4821514Z ^ 2025-05-07T20:05:12.4821838Z 2025-05-07T20:05:12.4822430Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:12.4823233Z 2025-05-07T20:05:12.4825344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:12.4828614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:12.4830081Z ^ 2025-05-07T20:05:12.4830523Z 2025-05-07T20:05:12.4832452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:12.4834666Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:12.4835373Z ^ 2025-05-07T20:05:12.4835725Z 2025-05-07T20:05:13.7144600Z [509/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && : 2025-05-07T20:05:14.3623741Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:14.3641230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3643703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3644659Z ^ 2025-05-07T20:05:14.3644898Z 2025-05-07T20:05:14.3645282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.3645774Z 2025-05-07T20:05:14.3646964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3649010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3650052Z ^ 2025-05-07T20:05:14.3650335Z 2025-05-07T20:05:14.3651545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3653683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3654588Z ^ 2025-05-07T20:05:14.3654822Z 2025-05-07T20:05:14.3655178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.3655688Z 2025-05-07T20:05:14.3657031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3659054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3659984Z ^ 2025-05-07T20:05:14.3660264Z 2025-05-07T20:05:14.3661506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3663486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3664396Z ^ 2025-05-07T20:05:14.3664603Z 2025-05-07T20:05:14.3665232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.3665743Z 2025-05-07T20:05:14.3666989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3668976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3673849Z ^ 2025-05-07T20:05:14.3674115Z 2025-05-07T20:05:14.3675375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3677285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3678160Z ^ 2025-05-07T20:05:14.3678366Z 2025-05-07T20:05:14.3678709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.3679189Z 2025-05-07T20:05:14.3680370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3682309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3683206Z ^ 2025-05-07T20:05:14.3683486Z 2025-05-07T20:05:14.3684629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3686524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3687558Z ^ 2025-05-07T20:05:14.3687765Z 2025-05-07T20:05:14.3688089Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.3688564Z 2025-05-07T20:05:14.3689772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.3691796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.3692676Z ^ 2025-05-07T20:05:14.3692949Z 2025-05-07T20:05:14.5304567Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:14.5322084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5324088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5324978Z ^ 2025-05-07T20:05:14.5325184Z 2025-05-07T20:05:14.5325538Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.5326024Z 2025-05-07T20:05:14.5327219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5329799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5330967Z ^ 2025-05-07T20:05:14.5331428Z 2025-05-07T20:05:14.5333027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5335626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5336727Z ^ 2025-05-07T20:05:14.5337019Z 2025-05-07T20:05:14.5337417Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.5338028Z 2025-05-07T20:05:14.5339599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5342197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5343335Z ^ 2025-05-07T20:05:14.5343700Z 2025-05-07T20:05:14.5345290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5347859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5349025Z ^ 2025-05-07T20:05:14.5349260Z 2025-05-07T20:05:14.5349708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.5350361Z 2025-05-07T20:05:14.5351919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5354101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5354971Z ^ 2025-05-07T20:05:14.5355262Z 2025-05-07T20:05:14.5356485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5358402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5359235Z ^ 2025-05-07T20:05:14.5359448Z 2025-05-07T20:05:14.5359808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.5360289Z 2025-05-07T20:05:14.5361534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5363564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5364422Z ^ 2025-05-07T20:05:14.5364967Z 2025-05-07T20:05:14.5366257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5368725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5369662Z ^ 2025-05-07T20:05:14.5369875Z 2025-05-07T20:05:14.5370317Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.5370833Z 2025-05-07T20:05:14.5372018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.5374070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.5374968Z ^ 2025-05-07T20:05:14.5375234Z 2025-05-07T20:05:18.0749394Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:18.0771638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0774253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0775528Z ^ 2025-05-07T20:05:18.0775781Z 2025-05-07T20:05:18.0776186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:18.0776822Z 2025-05-07T20:05:18.0778526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0781079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0782197Z ^ 2025-05-07T20:05:18.0782572Z 2025-05-07T20:05:18.0784367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0786775Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0787757Z ^ 2025-05-07T20:05:18.0787995Z 2025-05-07T20:05:18.0788410Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:18.0789059Z 2025-05-07T20:05:18.0790576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0793194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0794294Z ^ 2025-05-07T20:05:18.0794616Z 2025-05-07T20:05:18.0796119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0798383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0799457Z ^ 2025-05-07T20:05:18.0799892Z 2025-05-07T20:05:18.0800312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:18.0800907Z 2025-05-07T20:05:18.0802379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0804733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0805786Z ^ 2025-05-07T20:05:18.0806134Z 2025-05-07T20:05:18.0807589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0809989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0811093Z ^ 2025-05-07T20:05:18.0811332Z 2025-05-07T20:05:18.0811773Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:18.0812370Z 2025-05-07T20:05:18.0813843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0816351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0817644Z ^ 2025-05-07T20:05:18.0817999Z 2025-05-07T20:05:18.0819506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0822138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0823273Z ^ 2025-05-07T20:05:18.0823522Z 2025-05-07T20:05:18.0823949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:18.0824591Z 2025-05-07T20:05:18.0826166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:18.0828617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:18.0829745Z ^ 2025-05-07T20:05:18.0830061Z 2025-05-07T20:05:22.0645671Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:22.0664070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0666984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0667990Z ^ 2025-05-07T20:05:22.0668172Z 2025-05-07T20:05:22.0668752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:22.0669277Z 2025-05-07T20:05:22.0670592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0673561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0674859Z ^ 2025-05-07T20:05:22.0675279Z 2025-05-07T20:05:22.0677095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0679725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0680899Z ^ 2025-05-07T20:05:22.0681109Z 2025-05-07T20:05:22.0681453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:22.0681958Z 2025-05-07T20:05:22.0683229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0685407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0686378Z ^ 2025-05-07T20:05:22.0686750Z 2025-05-07T20:05:22.0688115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0690456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0691383Z ^ 2025-05-07T20:05:22.0691599Z 2025-05-07T20:05:22.0691981Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:22.0692515Z 2025-05-07T20:05:22.0693886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0696387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0697312Z ^ 2025-05-07T20:05:22.0697583Z 2025-05-07T20:05:22.0699133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0701384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0702324Z ^ 2025-05-07T20:05:22.0702540Z 2025-05-07T20:05:22.0702886Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:22.0703339Z 2025-05-07T20:05:22.0704712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0706756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0707769Z ^ 2025-05-07T20:05:22.0708012Z 2025-05-07T20:05:22.0709125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0710881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0711959Z ^ 2025-05-07T20:05:22.0712168Z 2025-05-07T20:05:22.0712478Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:22.0712909Z 2025-05-07T20:05:22.0714034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:22.0715834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:22.0716638Z ^ 2025-05-07T20:05:22.0716879Z 2025-05-07T20:05:24.4225372Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:24.4247450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4250423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4251560Z ^ 2025-05-07T20:05:24.4251807Z 2025-05-07T20:05:24.4252226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.4252854Z 2025-05-07T20:05:24.4254581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4257207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4258362Z ^ 2025-05-07T20:05:24.4258672Z 2025-05-07T20:05:24.4260229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4262726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4263744Z ^ 2025-05-07T20:05:24.4263970Z 2025-05-07T20:05:24.4264387Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.4265229Z 2025-05-07T20:05:24.4266970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4269452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4270710Z ^ 2025-05-07T20:05:24.4271006Z 2025-05-07T20:05:24.4272598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4275358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4276348Z ^ 2025-05-07T20:05:24.4276581Z 2025-05-07T20:05:24.4277058Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.4277758Z 2025-05-07T20:05:24.4279338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4281975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4283215Z ^ 2025-05-07T20:05:24.4283603Z 2025-05-07T20:05:24.4284892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4287153Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4288412Z ^ 2025-05-07T20:05:24.4288628Z 2025-05-07T20:05:24.4289010Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.4289608Z 2025-05-07T20:05:24.4292888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4295671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4296954Z ^ 2025-05-07T20:05:24.4297335Z 2025-05-07T20:05:24.4299136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4301676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4302832Z ^ 2025-05-07T20:05:24.4303044Z 2025-05-07T20:05:24.4303453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:24.4304155Z 2025-05-07T20:05:24.4305700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:24.4308251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:24.4309438Z ^ 2025-05-07T20:05:24.4309753Z 2025-05-07T20:05:24.7691723Z [515/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:27.4705165Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:27.4726111Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:29.1219928Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:05:37.7051283Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:05:40.1861591Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:05:40.1882109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1884482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1885543Z ^ 2025-05-07T20:05:40.1885718Z 2025-05-07T20:05:40.1886055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:40.1886542Z 2025-05-07T20:05:40.1887794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1890205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1891254Z ^ 2025-05-07T20:05:40.1891553Z 2025-05-07T20:05:40.1892897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1895271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1896308Z ^ 2025-05-07T20:05:40.1896553Z 2025-05-07T20:05:40.1897028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:40.1897650Z 2025-05-07T20:05:40.1899270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1901503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1902353Z ^ 2025-05-07T20:05:40.1902596Z 2025-05-07T20:05:40.1904174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1906449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1907531Z ^ 2025-05-07T20:05:40.1907853Z 2025-05-07T20:05:40.1908210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:40.1908825Z 2025-05-07T20:05:40.1910339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1912986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1914075Z ^ 2025-05-07T20:05:40.1914422Z 2025-05-07T20:05:40.1915940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1918446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1919532Z ^ 2025-05-07T20:05:40.1919780Z 2025-05-07T20:05:40.1920155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:40.1920753Z 2025-05-07T20:05:40.1922276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1924453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1925469Z ^ 2025-05-07T20:05:40.1925795Z 2025-05-07T20:05:40.1927249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1929745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1930849Z ^ 2025-05-07T20:05:40.1931063Z 2025-05-07T20:05:40.1931453Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:40.1932085Z 2025-05-07T20:05:40.1933589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:40.1936047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:40.1937150Z ^ 2025-05-07T20:05:40.1937492Z 2025-05-07T20:05:46.6608681Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:05:51.4306885Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:05:51.4330252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4333018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4334208Z ^ 2025-05-07T20:05:51.4334465Z 2025-05-07T20:05:51.4334916Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.4335740Z 2025-05-07T20:05:51.4337432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4340161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4341345Z ^ 2025-05-07T20:05:51.4341717Z 2025-05-07T20:05:51.4343427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4346102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4347307Z ^ 2025-05-07T20:05:51.4347560Z 2025-05-07T20:05:51.4348037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.4348699Z 2025-05-07T20:05:51.4365537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4368592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4369754Z ^ 2025-05-07T20:05:51.4370086Z 2025-05-07T20:05:51.4371772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4374427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4375583Z ^ 2025-05-07T20:05:51.4375832Z 2025-05-07T20:05:51.4376290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.4376960Z 2025-05-07T20:05:51.4378652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4381097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4382218Z ^ 2025-05-07T20:05:51.4382580Z 2025-05-07T20:05:51.4384160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4386579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4387497Z ^ 2025-05-07T20:05:51.4387708Z 2025-05-07T20:05:51.4388099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.4388710Z 2025-05-07T20:05:51.4390421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4393156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4394313Z ^ 2025-05-07T20:05:51.4394660Z 2025-05-07T20:05:51.4396425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4399022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4400201Z ^ 2025-05-07T20:05:51.4400442Z 2025-05-07T20:05:51.4400890Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:51.4401570Z 2025-05-07T20:05:51.4403259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:51.4405956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:51.4407124Z ^ 2025-05-07T20:05:51.4407482Z 2025-05-07T20:05:52.0092860Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:05:53.2681796Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:05:53.2697613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2699495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2700300Z ^ 2025-05-07T20:05:53.2700501Z 2025-05-07T20:05:53.2700811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.2701264Z 2025-05-07T20:05:53.2702431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2704419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2705220Z ^ 2025-05-07T20:05:53.2705465Z 2025-05-07T20:05:53.2706701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2708567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2709386Z ^ 2025-05-07T20:05:53.2709563Z 2025-05-07T20:05:53.2709896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.2710351Z 2025-05-07T20:05:53.2711696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2713540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2714339Z ^ 2025-05-07T20:05:53.2714597Z 2025-05-07T20:05:53.2715715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2717524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2718316Z ^ 2025-05-07T20:05:53.2718504Z 2025-05-07T20:05:53.2718818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.2719273Z 2025-05-07T20:05:53.2720416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2722221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2723142Z ^ 2025-05-07T20:05:53.2723393Z 2025-05-07T20:05:53.2724528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2726342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2727162Z ^ 2025-05-07T20:05:53.2727351Z 2025-05-07T20:05:53.2727651Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.2728130Z 2025-05-07T20:05:53.2729241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2731098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2731910Z ^ 2025-05-07T20:05:53.2732171Z 2025-05-07T20:05:53.2733290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2735106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2736026Z ^ 2025-05-07T20:05:53.2736213Z 2025-05-07T20:05:53.2736528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.2736990Z 2025-05-07T20:05:53.2738228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.2740094Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.2740936Z ^ 2025-05-07T20:05:53.2741180Z 2025-05-07T20:05:55.8724903Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:05:56.7355104Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:05:58.7376716Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:05:58.8715889Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:05:59.1196421Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:05:59.7093123Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:00.6807233Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:06:00.7417061Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:02.4534896Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:03.2500557Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:03.2523705Z In file included from tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:1: 2025-05-07T20:06:03.2526072Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2536366Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:03.2546217Z ^ 2025-05-07T20:06:03.2548720Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2552302Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2555873Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2565643Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:03.2573786Z ^ 2025-05-07T20:06:03.2576110Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2579368Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2582568Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2585869Z /tmp/tmpxft_00004275_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:03.2587803Z 8 warnings generated. 2025-05-07T20:06:03.4312840Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:03.4433442Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:06:03.4435661Z ################################################################################ 2025-05-07T20:06:03.4436285Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.4437142Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:06:03.4438017Z Removing all RPATHs ... 2025-05-07T20:06:03.4438493Z ################################################################################ 2025-05-07T20:06:03.4565025Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 1 2025-05-07T20:06:03.4567529Z ################################################################################ 2025-05-07T20:06:03.4568194Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.4569102Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:06:03.4569930Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:03.4570604Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:03.4571346Z ################################################################################ 2025-05-07T20:06:03.5567102Z [537/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:03.6107445Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:06:03.6132361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6135398Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6136643Z ^ 2025-05-07T20:06:03.6136896Z 2025-05-07T20:06:03.6137354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:03.6138035Z 2025-05-07T20:06:03.6139800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6142235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6143217Z ^ 2025-05-07T20:06:03.6143575Z 2025-05-07T20:06:03.6145055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6147760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6148945Z ^ 2025-05-07T20:06:03.6149195Z 2025-05-07T20:06:03.6149639Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:03.6150297Z 2025-05-07T20:06:03.6152135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6154845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6156025Z ^ 2025-05-07T20:06:03.6156513Z 2025-05-07T20:06:03.6158216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6160888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6162038Z ^ 2025-05-07T20:06:03.6162295Z 2025-05-07T20:06:03.6162727Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:03.6163386Z 2025-05-07T20:06:03.6165338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6168015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6169205Z ^ 2025-05-07T20:06:03.6169552Z 2025-05-07T20:06:03.6171201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6173836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6174974Z ^ 2025-05-07T20:06:03.6175375Z 2025-05-07T20:06:03.6175808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:03.6176482Z 2025-05-07T20:06:03.6178139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6180907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6182044Z ^ 2025-05-07T20:06:03.6182418Z 2025-05-07T20:06:03.6184134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6186642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6187766Z ^ 2025-05-07T20:06:03.6188019Z 2025-05-07T20:06:03.6188416Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:03.6189048Z 2025-05-07T20:06:03.6190686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:03.6193391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:03.6194564Z ^ 2025-05-07T20:06:03.6194914Z 2025-05-07T20:06:03.6196693Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:03.6198817Z ################################################################################ 2025-05-07T20:06:03.6199419Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.6200371Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:03.6201512Z Removing all RPATHs ... 2025-05-07T20:06:03.6201957Z ################################################################################ 2025-05-07T20:06:03.8000062Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:03.8002238Z ################################################################################ 2025-05-07T20:06:03.8002764Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.8003605Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:03.8004481Z Removing all RPATHs ... 2025-05-07T20:06:03.8004883Z ################################################################################ 2025-05-07T20:06:03.8070961Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:03.8073292Z ################################################################################ 2025-05-07T20:06:03.8073886Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.8074851Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:03.8075864Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:03.8076737Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:03.8077368Z ################################################################################ 2025-05-07T20:06:03.8155785Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:03.8157106Z ################################################################################ 2025-05-07T20:06:03.8157475Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.8158074Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:03.8158664Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:03.8159065Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:03.8159583Z ################################################################################ 2025-05-07T20:06:03.9176701Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:03.9177995Z ################################################################################ 2025-05-07T20:06:03.9178397Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.9178994Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:03.9179588Z Removing all RPATHs ... 2025-05-07T20:06:03.9179880Z ################################################################################ 2025-05-07T20:06:03.9448261Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:03.9449559Z ################################################################################ 2025-05-07T20:06:03.9449954Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.9450566Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:03.9451370Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:03.9451792Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:03.9452227Z ################################################################################ 2025-05-07T20:06:03.9536295Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:03.9538569Z ################################################################################ 2025-05-07T20:06:03.9539192Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.9540214Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:03.9541234Z Removing all RPATHs ... 2025-05-07T20:06:03.9541674Z ################################################################################ 2025-05-07T20:06:03.9642583Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:03.9644929Z ################################################################################ 2025-05-07T20:06:03.9645558Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.9646639Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:03.9648113Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:03.9648714Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:03.9649373Z ################################################################################ 2025-05-07T20:06:03.9651784Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:03.9654114Z ################################################################################ 2025-05-07T20:06:03.9654704Z [CMAKE] Running post-build script ... 2025-05-07T20:06:03.9655780Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:03.9657063Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:03.9657699Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:03.9658396Z ################################################################################ 2025-05-07T20:06:04.2347623Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:04.2350191Z ################################################################################ 2025-05-07T20:06:04.2350796Z [CMAKE] Running post-build script ... 2025-05-07T20:06:04.2352054Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:04.2353171Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:04.2353844Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:04.2354607Z ################################################################################ 2025-05-07T20:06:04.6265415Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:04.6287449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6290280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6291431Z ^ 2025-05-07T20:06:04.6291738Z 2025-05-07T20:06:04.6292161Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6292792Z 2025-05-07T20:06:04.6294354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6296866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6297957Z ^ 2025-05-07T20:06:04.6298317Z 2025-05-07T20:06:04.6299876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6302389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6303485Z ^ 2025-05-07T20:06:04.6303738Z 2025-05-07T20:06:04.6304142Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6304946Z 2025-05-07T20:06:04.6306390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6308755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6309797Z ^ 2025-05-07T20:06:04.6310126Z 2025-05-07T20:06:04.6311796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6314070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6314999Z ^ 2025-05-07T20:06:04.6315196Z 2025-05-07T20:06:04.6315548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6316089Z 2025-05-07T20:06:04.6317458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6319943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6321128Z ^ 2025-05-07T20:06:04.6321433Z 2025-05-07T20:06:04.6322981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6325649Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6326751Z ^ 2025-05-07T20:06:04.6326952Z 2025-05-07T20:06:04.6327375Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6327979Z 2025-05-07T20:06:04.6329609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6332099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6333175Z ^ 2025-05-07T20:06:04.6333514Z 2025-05-07T20:06:04.6335059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6337552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6338659Z ^ 2025-05-07T20:06:04.6338899Z 2025-05-07T20:06:04.6339291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:04.6339890Z 2025-05-07T20:06:04.6341507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:04.6343938Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:04.6345056Z ^ 2025-05-07T20:06:04.6345535Z 2025-05-07T20:06:05.6260117Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:06.8497088Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:07.4639698Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:07.7669090Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:10.2931876Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:11.5759038Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:11.6006726Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:12.1636039Z [557/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:12.1694088Z [558/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:12.1696901Z ################################################################################ 2025-05-07T20:06:12.1697584Z [CMAKE] Running post-build script ... 2025-05-07T20:06:12.1698787Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:12.1699960Z Removing all RPATHs ... 2025-05-07T20:06:12.1700487Z ################################################################################ 2025-05-07T20:06:12.7947279Z [559/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:13.5993948Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:13.6016262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6017881Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:13.6018783Z ^ 2025-05-07T20:06:13.6019039Z 2025-05-07T20:06:13.6019536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.6020231Z 2025-05-07T20:06:13.6021217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6022932Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:13.6023658Z ^ 2025-05-07T20:06:13.6023934Z 2025-05-07T20:06:13.6024932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6026565Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:13.6027330Z ^ 2025-05-07T20:06:13.6027575Z 2025-05-07T20:06:13.6028675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6030301Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:13.6031086Z ^ 2025-05-07T20:06:13.6031334Z 2025-05-07T20:06:13.6032426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6033989Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:13.6034774Z ^ 2025-05-07T20:06:13.6035020Z 2025-05-07T20:06:13.6035476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.6036181Z 2025-05-07T20:06:13.6037156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6038706Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:13.6039416Z ^ 2025-05-07T20:06:13.6039662Z 2025-05-07T20:06:13.6040667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6042253Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:13.6043025Z ^ 2025-05-07T20:06:13.6043331Z 2025-05-07T20:06:13.6044321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6045932Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:13.6046708Z ^ 2025-05-07T20:06:13.6046952Z 2025-05-07T20:06:13.6047923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6049515Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:13.6050267Z ^ 2025-05-07T20:06:13.6050528Z 2025-05-07T20:06:13.6050977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.6051656Z 2025-05-07T20:06:13.6052657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6054193Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:13.6054920Z ^ 2025-05-07T20:06:13.6055166Z 2025-05-07T20:06:13.6056147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6057764Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:13.6058547Z ^ 2025-05-07T20:06:13.6058790Z 2025-05-07T20:06:13.6059770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6061422Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:13.6062170Z ^ 2025-05-07T20:06:13.6062439Z 2025-05-07T20:06:13.6063414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6065377Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:13.6066168Z ^ 2025-05-07T20:06:13.6066441Z 2025-05-07T20:06:13.6066893Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.6067583Z 2025-05-07T20:06:13.6068579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6070195Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:13.6070926Z ^ 2025-05-07T20:06:13.6071170Z 2025-05-07T20:06:13.6072223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6073843Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:13.6074626Z ^ 2025-05-07T20:06:13.6074870Z 2025-05-07T20:06:13.6075859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6077466Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:13.6078213Z ^ 2025-05-07T20:06:13.6078483Z 2025-05-07T20:06:13.6079457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6081055Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:13.6081811Z ^ 2025-05-07T20:06:13.6082082Z 2025-05-07T20:06:13.6082541Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:13.6083230Z 2025-05-07T20:06:13.6084204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6085878Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:13.6086615Z ^ 2025-05-07T20:06:13.6086862Z 2025-05-07T20:06:13.6087855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6089470Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:13.6090230Z ^ 2025-05-07T20:06:13.6090505Z 2025-05-07T20:06:13.6091490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:13.6093097Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:13.6093846Z ^ 2025-05-07T20:06:13.6094091Z 2025-05-07T20:06:14.3902404Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:15.8624355Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:15.8637525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8638960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8639589Z ^ 2025-05-07T20:06:15.8639750Z 2025-05-07T20:06:15.8639998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.8640358Z 2025-05-07T20:06:15.8642970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8644501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8645162Z ^ 2025-05-07T20:06:15.8645361Z 2025-05-07T20:06:15.8646307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8647722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8648383Z ^ 2025-05-07T20:06:15.8648535Z 2025-05-07T20:06:15.8648814Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.8649182Z 2025-05-07T20:06:15.8650055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8651554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8652198Z ^ 2025-05-07T20:06:15.8652431Z 2025-05-07T20:06:15.8653312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8654742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8655383Z ^ 2025-05-07T20:06:15.8655560Z 2025-05-07T20:06:15.8655811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.8656178Z 2025-05-07T20:06:15.8657082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8658490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8659157Z ^ 2025-05-07T20:06:15.8659357Z 2025-05-07T20:06:15.8660222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8661705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8662365Z ^ 2025-05-07T20:06:15.8662515Z 2025-05-07T20:06:15.8662764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.8663152Z 2025-05-07T20:06:15.8664066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8665909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8666555Z ^ 2025-05-07T20:06:15.8666786Z 2025-05-07T20:06:15.8667744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8669174Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8669807Z ^ 2025-05-07T20:06:15.8669986Z 2025-05-07T20:06:15.8670240Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:15.8670603Z 2025-05-07T20:06:15.8671570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:15.8673025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:15.8673698Z ^ 2025-05-07T20:06:15.8673907Z 2025-05-07T20:06:16.9053203Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:16.9065388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9066892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9067530Z ^ 2025-05-07T20:06:16.9067678Z 2025-05-07T20:06:16.9067923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9068283Z 2025-05-07T20:06:16.9069158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9070553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9071190Z ^ 2025-05-07T20:06:16.9071387Z 2025-05-07T20:06:16.9072349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9073740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9074370Z ^ 2025-05-07T20:06:16.9074512Z 2025-05-07T20:06:16.9074770Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9075123Z 2025-05-07T20:06:16.9076046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9077454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9078079Z ^ 2025-05-07T20:06:16.9078289Z 2025-05-07T20:06:16.9079141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9080535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9081151Z ^ 2025-05-07T20:06:16.9081301Z 2025-05-07T20:06:16.9081542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9081897Z 2025-05-07T20:06:16.9082768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9084154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9084793Z ^ 2025-05-07T20:06:16.9084987Z 2025-05-07T20:06:16.9085909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9087284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9087968Z ^ 2025-05-07T20:06:16.9088113Z 2025-05-07T20:06:16.9088352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9088717Z 2025-05-07T20:06:16.9089579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9091025Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9091657Z ^ 2025-05-07T20:06:16.9091869Z 2025-05-07T20:06:16.9092719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9094118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9094736Z ^ 2025-05-07T20:06:16.9094889Z 2025-05-07T20:06:16.9095127Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:16.9095482Z 2025-05-07T20:06:16.9096350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:16.9097756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:16.9098395Z ^ 2025-05-07T20:06:16.9098592Z 2025-05-07T20:06:17.5463370Z [564/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:17.5839790Z [565/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:17.5841079Z ################################################################################ 2025-05-07T20:06:17.5841426Z [CMAKE] Running post-build script ... 2025-05-07T20:06:17.5842023Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:17.5842611Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:17.5842993Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:17.5843435Z ################################################################################ 2025-05-07T20:06:21.5748376Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:21.5759654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5760536Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:21.5760957Z ^ 2025-05-07T20:06:21.5761107Z 2025-05-07T20:06:21.5761428Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:21.5761780Z 2025-05-07T20:06:21.5762325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5763143Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:21.5763536Z ^ 2025-05-07T20:06:21.5763668Z 2025-05-07T20:06:21.5764260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5765363Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:21.5765779Z ^ 2025-05-07T20:06:21.5765910Z 2025-05-07T20:06:21.5766438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5767307Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:21.5767734Z ^ 2025-05-07T20:06:21.5767868Z 2025-05-07T20:06:21.5768393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5769302Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:21.5769844Z ^ 2025-05-07T20:06:21.5769985Z 2025-05-07T20:06:21.5770508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5771373Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:21.5771787Z ^ 2025-05-07T20:06:21.5771916Z 2025-05-07T20:06:21.5772172Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:21.5772526Z 2025-05-07T20:06:21.5773052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5773886Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:21.5774277Z ^ 2025-05-07T20:06:21.5774407Z 2025-05-07T20:06:21.5774931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5775792Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:21.5776192Z ^ 2025-05-07T20:06:21.5776333Z 2025-05-07T20:06:21.5776860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5777724Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:21.5778130Z ^ 2025-05-07T20:06:21.5778260Z 2025-05-07T20:06:21.5778799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5779767Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:21.5780229Z ^ 2025-05-07T20:06:21.5780360Z 2025-05-07T20:06:21.5780964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5781815Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:21.5782233Z ^ 2025-05-07T20:06:21.5782362Z 2025-05-07T20:06:21.5782601Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:21.5782965Z 2025-05-07T20:06:21.5783538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5784373Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:21.5784743Z ^ 2025-05-07T20:06:21.5784872Z 2025-05-07T20:06:21.5785411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5786258Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:21.5786672Z ^ 2025-05-07T20:06:21.5786801Z 2025-05-07T20:06:21.5787344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5788192Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:21.5788617Z ^ 2025-05-07T20:06:21.5788745Z 2025-05-07T20:06:21.5789273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5790171Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:21.5790631Z ^ 2025-05-07T20:06:21.5790757Z 2025-05-07T20:06:21.5791279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5792315Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:21.5792727Z ^ 2025-05-07T20:06:21.5792873Z 2025-05-07T20:06:21.5793116Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:21.5793469Z 2025-05-07T20:06:21.5794003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5794822Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:21.5795217Z ^ 2025-05-07T20:06:21.5795348Z 2025-05-07T20:06:21.5795868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5796720Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:21.5797128Z ^ 2025-05-07T20:06:21.5797256Z 2025-05-07T20:06:21.5797783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5798651Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:21.5799066Z ^ 2025-05-07T20:06:21.5799207Z 2025-05-07T20:06:21.5799733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5800707Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:21.5801156Z ^ 2025-05-07T20:06:21.5801284Z 2025-05-07T20:06:21.5801817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5802712Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:21.5803136Z ^ 2025-05-07T20:06:21.5803296Z 2025-05-07T20:06:21.5803534Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:21.5803885Z 2025-05-07T20:06:21.5804407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5805235Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:21.5805663Z ^ 2025-05-07T20:06:21.5805800Z 2025-05-07T20:06:21.5806325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5807181Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:21.5807587Z ^ 2025-05-07T20:06:21.5807735Z 2025-05-07T20:06:21.5808263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5809129Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:21.5809540Z ^ 2025-05-07T20:06:21.5809682Z 2025-05-07T20:06:21.5810210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:21.5811103Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:21.5811573Z ^ 2025-05-07T20:06:21.5811704Z 2025-05-07T20:06:24.6364030Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:06:25.8227172Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:06:25.8238563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8239435Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:25.8239869Z ^ 2025-05-07T20:06:25.8240005Z 2025-05-07T20:06:25.8240264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:25.8240620Z 2025-05-07T20:06:25.8241152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8242022Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:25.8242440Z ^ 2025-05-07T20:06:25.8242587Z 2025-05-07T20:06:25.8243115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8243978Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:25.8244490Z ^ 2025-05-07T20:06:25.8244631Z 2025-05-07T20:06:25.8244865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:25.8245259Z 2025-05-07T20:06:25.8245794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8246685Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:25.8247107Z ^ 2025-05-07T20:06:25.8247235Z 2025-05-07T20:06:25.8247757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8248621Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:25.8249041Z ^ 2025-05-07T20:06:25.8249168Z 2025-05-07T20:06:25.8249450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:25.8249824Z 2025-05-07T20:06:25.8250345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8251196Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:25.8251617Z ^ 2025-05-07T20:06:25.8251750Z 2025-05-07T20:06:25.8252284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8253133Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:25.8253556Z ^ 2025-05-07T20:06:25.8253683Z 2025-05-07T20:06:25.8253920Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:25.8254285Z 2025-05-07T20:06:25.8254809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8255665Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:25.8256073Z ^ 2025-05-07T20:06:25.8256218Z 2025-05-07T20:06:25.8256738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8257621Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:25.8258045Z ^ 2025-05-07T20:06:25.8258174Z 2025-05-07T20:06:25.8258423Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:25.8258773Z 2025-05-07T20:06:25.8259292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:25.8260152Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:25.8260567Z ^ 2025-05-07T20:06:25.8260707Z 2025-05-07T20:06:29.4633214Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:06:29.4655841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.4657319Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.4658093Z ^ 2025-05-07T20:06:29.4658317Z 2025-05-07T20:06:29.4658748Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.4659384Z 2025-05-07T20:06:29.4660255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.4661748Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.4662506Z ^ 2025-05-07T20:06:29.4662728Z 2025-05-07T20:06:29.4663150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.4664001Z 2025-05-07T20:06:29.4665082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.4666604Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.4667358Z ^ 2025-05-07T20:06:29.4667572Z 2025-05-07T20:06:29.4668028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.4668641Z 2025-05-07T20:06:29.4669486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.4670894Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.4671748Z ^ 2025-05-07T20:06:29.4672001Z 2025-05-07T20:06:29.4672423Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.4673009Z 2025-05-07T20:06:29.4673878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.4675293Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:29.4676041Z ^ 2025-05-07T20:06:29.4676265Z 2025-05-07T20:06:29.4676699Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.4677308Z 2025-05-07T20:06:29.6417560Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:06:29.7308419Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:06:29.7328510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7330272Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:29.7331024Z ^ 2025-05-07T20:06:29.7331237Z 2025-05-07T20:06:29.7331657Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.7332278Z 2025-05-07T20:06:29.7333212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7334663Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:29.7335309Z ^ 2025-05-07T20:06:29.7335552Z 2025-05-07T20:06:29.7336525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7338076Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:29.7338773Z ^ 2025-05-07T20:06:29.7338989Z 2025-05-07T20:06:29.7340013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7341587Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:29.7342378Z ^ 2025-05-07T20:06:29.7342600Z 2025-05-07T20:06:29.7343562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7345289Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:29.7346058Z ^ 2025-05-07T20:06:29.7346284Z 2025-05-07T20:06:29.7347213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7348640Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:29.7349389Z ^ 2025-05-07T20:06:29.7349650Z 2025-05-07T20:06:29.7350100Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.7350735Z 2025-05-07T20:06:29.7351812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7353251Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:29.7353925Z ^ 2025-05-07T20:06:29.7354140Z 2025-05-07T20:06:29.7355093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7356620Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:29.7357303Z ^ 2025-05-07T20:06:29.7357535Z 2025-05-07T20:06:29.7358507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7360283Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:29.7361027Z ^ 2025-05-07T20:06:29.7361263Z 2025-05-07T20:06:29.7362189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7363715Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:29.7364567Z ^ 2025-05-07T20:06:29.7365038Z 2025-05-07T20:06:29.7366077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7367570Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:29.7368283Z ^ 2025-05-07T20:06:29.7368507Z 2025-05-07T20:06:29.7369132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.7369763Z 2025-05-07T20:06:29.7370685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7372103Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:29.7372726Z ^ 2025-05-07T20:06:29.7372979Z 2025-05-07T20:06:29.7373914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7375474Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:29.7376157Z ^ 2025-05-07T20:06:29.7376392Z 2025-05-07T20:06:29.7377367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7378974Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:29.7379750Z ^ 2025-05-07T20:06:29.7379976Z 2025-05-07T20:06:29.7380940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7382500Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:29.7383445Z ^ 2025-05-07T20:06:29.7383684Z 2025-05-07T20:06:29.7384578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7386080Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:29.7386832Z ^ 2025-05-07T20:06:29.7387049Z 2025-05-07T20:06:29.7387461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.7388074Z 2025-05-07T20:06:29.7389051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7390520Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:29.7391156Z ^ 2025-05-07T20:06:29.7391384Z 2025-05-07T20:06:29.7392428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7393930Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:29.7394653Z ^ 2025-05-07T20:06:29.7394877Z 2025-05-07T20:06:29.7395817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7397330Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:29.7398089Z ^ 2025-05-07T20:06:29.7398310Z 2025-05-07T20:06:29.7399415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7401003Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:29.7401731Z ^ 2025-05-07T20:06:29.7401976Z 2025-05-07T20:06:29.7403050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7404613Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:29.7405330Z ^ 2025-05-07T20:06:29.7405562Z 2025-05-07T20:06:29.7405991Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:29.7406621Z 2025-05-07T20:06:29.7407678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7409148Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:29.7409799Z ^ 2025-05-07T20:06:29.7410027Z 2025-05-07T20:06:29.7410944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7412508Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:29.7413173Z ^ 2025-05-07T20:06:29.7413409Z 2025-05-07T20:06:29.7414350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7415932Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:29.7416673Z ^ 2025-05-07T20:06:29.7416908Z 2025-05-07T20:06:29.7417880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:29.7419418Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:29.7420181Z ^ 2025-05-07T20:06:29.7420409Z 2025-05-07T20:06:31.2307427Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:06:38.2948869Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:06:42.3918195Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:06:42.3929612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3931027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3931661Z ^ 2025-05-07T20:06:42.3931813Z 2025-05-07T20:06:42.3932056Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:42.3932421Z 2025-05-07T20:06:42.3933328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3934831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3935485Z ^ 2025-05-07T20:06:42.3935686Z 2025-05-07T20:06:42.3936558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3937956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3938595Z ^ 2025-05-07T20:06:42.3938735Z 2025-05-07T20:06:42.3938980Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:42.3939350Z 2025-05-07T20:06:42.3940258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3941674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3942328Z ^ 2025-05-07T20:06:42.3942525Z 2025-05-07T20:06:42.3943386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3944844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3945465Z ^ 2025-05-07T20:06:42.3945622Z 2025-05-07T20:06:42.3945864Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:42.3946223Z 2025-05-07T20:06:42.3947155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3948552Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3949193Z ^ 2025-05-07T20:06:42.3949389Z 2025-05-07T20:06:42.3950303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3951798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3952467Z ^ 2025-05-07T20:06:42.3952613Z 2025-05-07T20:06:42.3952856Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:42.3953227Z 2025-05-07T20:06:42.3954088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3955495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3956123Z ^ 2025-05-07T20:06:42.3956334Z 2025-05-07T20:06:42.3957190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3958588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3959277Z ^ 2025-05-07T20:06:42.3959430Z 2025-05-07T20:06:42.3959671Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:42.3960026Z 2025-05-07T20:06:42.3960897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:42.3962292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:42.3962930Z ^ 2025-05-07T20:06:42.3963124Z 2025-05-07T20:06:43.3407396Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:06:43.3418850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3420257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3420933Z ^ 2025-05-07T20:06:43.3421090Z 2025-05-07T20:06:43.3421336Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3421695Z 2025-05-07T20:06:43.3422565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3424049Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3424693Z ^ 2025-05-07T20:06:43.3424892Z 2025-05-07T20:06:43.3425752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3427158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3427790Z ^ 2025-05-07T20:06:43.3427930Z 2025-05-07T20:06:43.3428173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3428541Z 2025-05-07T20:06:43.3429406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3430811Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3431438Z ^ 2025-05-07T20:06:43.3431737Z 2025-05-07T20:06:43.3432609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3434041Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3434670Z ^ 2025-05-07T20:06:43.3434812Z 2025-05-07T20:06:43.3435099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3435453Z 2025-05-07T20:06:43.3436312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3437752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3438396Z ^ 2025-05-07T20:06:43.3438595Z 2025-05-07T20:06:43.3439449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3440838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3441467Z ^ 2025-05-07T20:06:43.3441605Z 2025-05-07T20:06:43.3441843Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3442195Z 2025-05-07T20:06:43.3443072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3444469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3445109Z ^ 2025-05-07T20:06:43.3445304Z 2025-05-07T20:06:43.3446167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3447579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3448200Z ^ 2025-05-07T20:06:43.3448339Z 2025-05-07T20:06:43.3448592Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3448944Z 2025-05-07T20:06:43.3449812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3451209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3451835Z ^ 2025-05-07T20:06:43.3452045Z 2025-05-07T20:06:47.0152814Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:06:47.0165925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0167332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0167973Z ^ 2025-05-07T20:06:47.0168117Z 2025-05-07T20:06:47.0168368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.0168850Z 2025-05-07T20:06:47.0169723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0171143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0171772Z ^ 2025-05-07T20:06:47.0171984Z 2025-05-07T20:06:47.0172847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0174241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0174858Z ^ 2025-05-07T20:06:47.0175014Z 2025-05-07T20:06:47.0175259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.0175615Z 2025-05-07T20:06:47.0176480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0186154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0186996Z ^ 2025-05-07T20:06:47.0187365Z 2025-05-07T20:06:47.0187915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:47.0188711Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:47.0189037Z ^ 2025-05-07T20:06:47.0189214Z 2025-05-07T20:06:47.0190149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0191665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0192298Z ^ 2025-05-07T20:06:47.0192441Z 2025-05-07T20:06:47.0192785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.0193147Z 2025-05-07T20:06:47.0194007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0195421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0196071Z ^ 2025-05-07T20:06:47.0196272Z 2025-05-07T20:06:47.0196807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:47.0197602Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:47.0197924Z ^ 2025-05-07T20:06:47.0198104Z 2025-05-07T20:06:47.0198963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0200361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0200981Z ^ 2025-05-07T20:06:47.0201223Z 2025-05-07T20:06:47.0201466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.0201819Z 2025-05-07T20:06:47.0202689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0204101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0204751Z ^ 2025-05-07T20:06:47.0204951Z 2025-05-07T20:06:47.0205484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:47.0206273Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:47.0206609Z ^ 2025-05-07T20:06:47.0206777Z 2025-05-07T20:06:47.0207638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0209033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0209651Z ^ 2025-05-07T20:06:47.0209805Z 2025-05-07T20:06:47.0210046Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:47.0210404Z 2025-05-07T20:06:47.0211319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:47.0212706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:47.0213380Z ^ 2025-05-07T20:06:47.0213580Z 2025-05-07T20:06:47.0214129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:06:47.0214908Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:06:47.0215250Z ^ 2025-05-07T20:06:47.0215415Z 2025-05-07T20:06:50.9018409Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:06:50.9030139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9031659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9032307Z ^ 2025-05-07T20:06:50.9032458Z 2025-05-07T20:06:50.9032709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.9033088Z 2025-05-07T20:06:50.9033968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9035472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9036114Z ^ 2025-05-07T20:06:50.9036334Z 2025-05-07T20:06:50.9037255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9038663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9039288Z ^ 2025-05-07T20:06:50.9039451Z 2025-05-07T20:06:50.9039730Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.9040094Z 2025-05-07T20:06:50.9040978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9042378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9043031Z ^ 2025-05-07T20:06:50.9043231Z 2025-05-07T20:06:50.9044081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9045485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9046126Z ^ 2025-05-07T20:06:50.9046273Z 2025-05-07T20:06:50.9046519Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.9046889Z 2025-05-07T20:06:50.9047761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9049206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9049841Z ^ 2025-05-07T20:06:50.9050056Z 2025-05-07T20:06:50.9050919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9052323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9052948Z ^ 2025-05-07T20:06:50.9053094Z 2025-05-07T20:06:50.9053352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.9053708Z 2025-05-07T20:06:50.9054578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9055988Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9056635Z ^ 2025-05-07T20:06:50.9056834Z 2025-05-07T20:06:50.9057693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9059202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9059835Z ^ 2025-05-07T20:06:50.9059979Z 2025-05-07T20:06:50.9060250Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.9060607Z 2025-05-07T20:06:50.9061492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:50.9062892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:50.9063564Z ^ 2025-05-07T20:06:50.9063767Z 2025-05-07T20:06:52.6781580Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:06:52.6793488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6794916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6795545Z ^ 2025-05-07T20:06:52.6795707Z 2025-05-07T20:06:52.6795955Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:52.6796427Z 2025-05-07T20:06:52.6797308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6798766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6799413Z ^ 2025-05-07T20:06:52.6799616Z 2025-05-07T20:06:52.6800491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6801937Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6802574Z ^ 2025-05-07T20:06:52.6802718Z 2025-05-07T20:06:52.6802962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:52.6803334Z 2025-05-07T20:06:52.6804198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6805609Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6806242Z ^ 2025-05-07T20:06:52.6806454Z 2025-05-07T20:06:52.6807311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6808704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6809321Z ^ 2025-05-07T20:06:52.6809475Z 2025-05-07T20:06:52.6809716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:52.6810105Z 2025-05-07T20:06:52.6810991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6812381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6813024Z ^ 2025-05-07T20:06:52.6813222Z 2025-05-07T20:06:52.6814077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6815478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6816109Z ^ 2025-05-07T20:06:52.6816251Z 2025-05-07T20:06:52.6816492Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:52.6816859Z 2025-05-07T20:06:52.6817722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6819123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6819752Z ^ 2025-05-07T20:06:52.6819997Z 2025-05-07T20:06:52.6820850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6822244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6822893Z ^ 2025-05-07T20:06:52.6823035Z 2025-05-07T20:06:52.6823293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:52.6823648Z 2025-05-07T20:06:52.6824509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:52.6825943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:52.6826593Z ^ 2025-05-07T20:06:52.6826787Z 2025-05-07T20:06:58.9596051Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:06:58.9608131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9609563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9610317Z ^ 2025-05-07T20:06:58.9610473Z 2025-05-07T20:06:58.9610752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.9611123Z 2025-05-07T20:06:58.9612068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9613517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9614182Z ^ 2025-05-07T20:06:58.9614390Z 2025-05-07T20:06:58.9615323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9616755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9617421Z ^ 2025-05-07T20:06:58.9617580Z 2025-05-07T20:06:58.9617839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.9618209Z 2025-05-07T20:06:58.9619108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9620526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9621197Z ^ 2025-05-07T20:06:58.9621403Z 2025-05-07T20:06:58.9622295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9623694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9624348Z ^ 2025-05-07T20:06:58.9624555Z 2025-05-07T20:06:58.9624826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.9625191Z 2025-05-07T20:06:58.9626069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9627510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9628164Z ^ 2025-05-07T20:06:58.9628401Z 2025-05-07T20:06:58.9629264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9630682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9631313Z ^ 2025-05-07T20:06:58.9631592Z 2025-05-07T20:06:58.9631850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.9632217Z 2025-05-07T20:06:58.9633111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9634526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9635240Z ^ 2025-05-07T20:06:58.9635445Z 2025-05-07T20:06:58.9636327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9637766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9638421Z ^ 2025-05-07T20:06:58.9638568Z 2025-05-07T20:06:58.9638818Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:58.9639198Z 2025-05-07T20:06:58.9640111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:58.9641543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:58.9642192Z ^ 2025-05-07T20:06:58.9642414Z 2025-05-07T20:06:59.9787448Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:06:59.9798916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9800379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9801125Z ^ 2025-05-07T20:06:59.9801281Z 2025-05-07T20:06:59.9801535Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.9801922Z 2025-05-07T20:06:59.9802856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9804296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9804945Z ^ 2025-05-07T20:06:59.9805173Z 2025-05-07T20:06:59.9806095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9807519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9808161Z ^ 2025-05-07T20:06:59.9808311Z 2025-05-07T20:06:59.9808589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.9808957Z 2025-05-07T20:06:59.9809842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9811334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9812005Z ^ 2025-05-07T20:06:59.9812212Z 2025-05-07T20:06:59.9813074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9814502Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9815199Z ^ 2025-05-07T20:06:59.9815346Z 2025-05-07T20:06:59.9815596Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.9815978Z 2025-05-07T20:06:59.9816848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9818281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9818933Z ^ 2025-05-07T20:06:59.9819156Z 2025-05-07T20:06:59.9820014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9821458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9822091Z ^ 2025-05-07T20:06:59.9822259Z 2025-05-07T20:06:59.9822507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.9822869Z 2025-05-07T20:06:59.9823765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9825215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9825885Z ^ 2025-05-07T20:06:59.9826091Z 2025-05-07T20:06:59.9826992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9828417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9829081Z ^ 2025-05-07T20:06:59.9829229Z 2025-05-07T20:06:59.9829483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:59.9829877Z 2025-05-07T20:06:59.9830786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:59.9832295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:59.9832947Z ^ 2025-05-07T20:06:59.9833181Z 2025-05-07T20:07:03.5246035Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:03.5257463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5258951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5259614Z ^ 2025-05-07T20:07:03.5259768Z 2025-05-07T20:07:03.5260027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.5260420Z 2025-05-07T20:07:03.5261357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5262797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5263452Z ^ 2025-05-07T20:07:03.5263674Z 2025-05-07T20:07:03.5264603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5266221Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5266868Z ^ 2025-05-07T20:07:03.5267040Z 2025-05-07T20:07:03.5267289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.5267654Z 2025-05-07T20:07:03.5268552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5270003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5270665Z ^ 2025-05-07T20:07:03.5270872Z 2025-05-07T20:07:03.5271836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5273292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5273943Z ^ 2025-05-07T20:07:03.5274090Z 2025-05-07T20:07:03.5274338Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.5274728Z 2025-05-07T20:07:03.5275605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5277040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5277679Z ^ 2025-05-07T20:07:03.5277910Z 2025-05-07T20:07:03.5278777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5280193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5280823Z ^ 2025-05-07T20:07:03.5280990Z 2025-05-07T20:07:03.5281241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.5281604Z 2025-05-07T20:07:03.5282499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5283964Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5284641Z ^ 2025-05-07T20:07:03.5284849Z 2025-05-07T20:07:03.5285753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5287184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5287841Z ^ 2025-05-07T20:07:03.5287991Z 2025-05-07T20:07:03.5288287Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:03.5288681Z 2025-05-07T20:07:03.5289556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:03.5290989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:03.5291637Z ^ 2025-05-07T20:07:03.5291862Z 2025-05-07T20:07:04.6468998Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:04.6480542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6482065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6482712Z ^ 2025-05-07T20:07:04.6482893Z 2025-05-07T20:07:04.6483215Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.6483585Z 2025-05-07T20:07:04.6484498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6487297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6488007Z ^ 2025-05-07T20:07:04.6488218Z 2025-05-07T20:07:04.6489111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6490511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6491166Z ^ 2025-05-07T20:07:04.6491315Z 2025-05-07T20:07:04.6491566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.6491943Z 2025-05-07T20:07:04.6492822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6494253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6494901Z ^ 2025-05-07T20:07:04.6495130Z 2025-05-07T20:07:04.6495999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6497465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6498097Z ^ 2025-05-07T20:07:04.6498266Z 2025-05-07T20:07:04.6498514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.6498877Z 2025-05-07T20:07:04.6499781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6501196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6501863Z ^ 2025-05-07T20:07:04.6502072Z 2025-05-07T20:07:04.6502944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6504360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6505016Z ^ 2025-05-07T20:07:04.6505164Z 2025-05-07T20:07:04.6505414Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.6505804Z 2025-05-07T20:07:04.6506716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6508138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6508821Z ^ 2025-05-07T20:07:04.6509056Z 2025-05-07T20:07:04.6509919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6511337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6512082Z ^ 2025-05-07T20:07:04.6512236Z 2025-05-07T20:07:04.6512510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.6512871Z 2025-05-07T20:07:04.6513746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.6515183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.6515852Z ^ 2025-05-07T20:07:04.6516055Z 2025-05-07T20:07:05.4502448Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:05.4514252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4515654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4516372Z ^ 2025-05-07T20:07:05.4516520Z 2025-05-07T20:07:05.4516766Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.4517134Z 2025-05-07T20:07:05.4517999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4519471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4520102Z ^ 2025-05-07T20:07:05.4520312Z 2025-05-07T20:07:05.4521169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4522571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4523187Z ^ 2025-05-07T20:07:05.4523337Z 2025-05-07T20:07:05.4523577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.4523933Z 2025-05-07T20:07:05.4524796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4526207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4526847Z ^ 2025-05-07T20:07:05.4527044Z 2025-05-07T20:07:05.4527901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4529338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4529966Z ^ 2025-05-07T20:07:05.4530106Z 2025-05-07T20:07:05.4530350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.4530720Z 2025-05-07T20:07:05.4531591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4533004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4533630Z ^ 2025-05-07T20:07:05.4533828Z 2025-05-07T20:07:05.4534692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4536065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4536693Z ^ 2025-05-07T20:07:05.4536836Z 2025-05-07T20:07:05.4537093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.4537479Z 2025-05-07T20:07:05.4538348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4539806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4540453Z ^ 2025-05-07T20:07:05.4540650Z 2025-05-07T20:07:05.4541503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4542929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4543550Z ^ 2025-05-07T20:07:05.4543703Z 2025-05-07T20:07:05.4543943Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:05.4544296Z 2025-05-07T20:07:05.4545172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:05.4546561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:05.4547193Z ^ 2025-05-07T20:07:05.4547388Z 2025-05-07T20:07:09.3614069Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:09.3625848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3627331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3627968Z ^ 2025-05-07T20:07:09.3628113Z 2025-05-07T20:07:09.3628392Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.3628764Z 2025-05-07T20:07:09.3629693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3631113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3631837Z ^ 2025-05-07T20:07:09.3632054Z 2025-05-07T20:07:09.3632917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3634320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3634940Z ^ 2025-05-07T20:07:09.3635080Z 2025-05-07T20:07:09.3635332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.3635686Z 2025-05-07T20:07:09.3636553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3637951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3638664Z ^ 2025-05-07T20:07:09.3638863Z 2025-05-07T20:07:09.3639720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3641126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3641751Z ^ 2025-05-07T20:07:09.3641892Z 2025-05-07T20:07:09.3642132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.3642492Z 2025-05-07T20:07:09.3643372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3644767Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3645403Z ^ 2025-05-07T20:07:09.3645598Z 2025-05-07T20:07:09.3646462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3647851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3648518Z ^ 2025-05-07T20:07:09.3648656Z 2025-05-07T20:07:09.3648906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.3649259Z 2025-05-07T20:07:09.3650158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3651565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3652195Z ^ 2025-05-07T20:07:09.3652389Z 2025-05-07T20:07:09.3653280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3654676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3655289Z ^ 2025-05-07T20:07:09.3655439Z 2025-05-07T20:07:09.3655676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.3656030Z 2025-05-07T20:07:09.3656906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.3658303Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.3658942Z ^ 2025-05-07T20:07:09.3659136Z 2025-05-07T20:07:09.5083077Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:09.5094733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5096156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5096779Z ^ 2025-05-07T20:07:09.5096936Z 2025-05-07T20:07:09.5097179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.5097532Z 2025-05-07T20:07:09.5098470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5099868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5100514Z ^ 2025-05-07T20:07:09.5100715Z 2025-05-07T20:07:09.5101586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5102974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5103603Z ^ 2025-05-07T20:07:09.5103742Z 2025-05-07T20:07:09.5103983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.5104349Z 2025-05-07T20:07:09.5105211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5106640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5107320Z ^ 2025-05-07T20:07:09.5107516Z 2025-05-07T20:07:09.5108373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5109777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5110415Z ^ 2025-05-07T20:07:09.5110556Z 2025-05-07T20:07:09.5110797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.5111162Z 2025-05-07T20:07:09.5112107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5113534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5114161Z ^ 2025-05-07T20:07:09.5114373Z 2025-05-07T20:07:09.5115232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5116622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5117281Z ^ 2025-05-07T20:07:09.5117422Z 2025-05-07T20:07:09.5117679Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.5118032Z 2025-05-07T20:07:09.5118926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5120334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5120973Z ^ 2025-05-07T20:07:09.5121170Z 2025-05-07T20:07:09.5122054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5123462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5124090Z ^ 2025-05-07T20:07:09.5124234Z 2025-05-07T20:07:09.5124476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:09.5124827Z 2025-05-07T20:07:09.5125702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:09.5127101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:09.5127747Z ^ 2025-05-07T20:07:09.5127944Z 2025-05-07T20:07:11.7443285Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:11.7455122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7456527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7457167Z ^ 2025-05-07T20:07:11.7457315Z 2025-05-07T20:07:11.7457676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.7458046Z 2025-05-07T20:07:11.7458919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7460332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7460974Z ^ 2025-05-07T20:07:11.7461190Z 2025-05-07T20:07:11.7462049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7463446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7464070Z ^ 2025-05-07T20:07:11.7464225Z 2025-05-07T20:07:11.7464470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.7465044Z 2025-05-07T20:07:11.7465923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7467395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7468046Z ^ 2025-05-07T20:07:11.7468248Z 2025-05-07T20:07:11.7469118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7470510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7471156Z ^ 2025-05-07T20:07:11.7471304Z 2025-05-07T20:07:11.7471611Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.7471983Z 2025-05-07T20:07:11.7472858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7474277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7474914Z ^ 2025-05-07T20:07:11.7475129Z 2025-05-07T20:07:11.7475992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7477442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7478064Z ^ 2025-05-07T20:07:11.7478223Z 2025-05-07T20:07:11.7478469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.7478867Z 2025-05-07T20:07:11.7479737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7481147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7481834Z ^ 2025-05-07T20:07:11.7482037Z 2025-05-07T20:07:11.7482895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7484302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7484940Z ^ 2025-05-07T20:07:11.7485081Z 2025-05-07T20:07:11.7485321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.7485685Z 2025-05-07T20:07:11.7486552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.7487961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.7488592Z ^ 2025-05-07T20:07:11.7488790Z 2025-05-07T20:07:11.8369813Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:11.8381132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8382547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8383231Z ^ 2025-05-07T20:07:11.8383390Z 2025-05-07T20:07:11.8383638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.8383994Z 2025-05-07T20:07:11.8384877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8386273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8386911Z ^ 2025-05-07T20:07:11.8387105Z 2025-05-07T20:07:11.8387966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8389353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8389984Z ^ 2025-05-07T20:07:11.8390125Z 2025-05-07T20:07:11.8390365Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.8390730Z 2025-05-07T20:07:11.8391673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8393127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8393758Z ^ 2025-05-07T20:07:11.8393970Z 2025-05-07T20:07:11.8394828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8396216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8396833Z ^ 2025-05-07T20:07:11.8396984Z 2025-05-07T20:07:11.8397224Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.8397578Z 2025-05-07T20:07:11.8398441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8399844Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8400480Z ^ 2025-05-07T20:07:11.8400676Z 2025-05-07T20:07:11.8401532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8402955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8403582Z ^ 2025-05-07T20:07:11.8403722Z 2025-05-07T20:07:11.8403992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.8404360Z 2025-05-07T20:07:11.8405220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8406653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8407285Z ^ 2025-05-07T20:07:11.8407482Z 2025-05-07T20:07:11.8408354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8409736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8410373Z ^ 2025-05-07T20:07:11.8410526Z 2025-05-07T20:07:11.8410791Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:11.8411150Z 2025-05-07T20:07:11.8412025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:11.8413451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:11.8414119Z ^ 2025-05-07T20:07:11.8414321Z 2025-05-07T20:07:13.7325836Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:13.7337389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7338845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7339484Z ^ 2025-05-07T20:07:13.7339628Z 2025-05-07T20:07:13.7339887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.7340244Z 2025-05-07T20:07:13.7341118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7342521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7343160Z ^ 2025-05-07T20:07:13.7343355Z 2025-05-07T20:07:13.7344212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7345601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7346218Z ^ 2025-05-07T20:07:13.7346382Z 2025-05-07T20:07:13.7346633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.7348244Z 2025-05-07T20:07:13.7349150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7350564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7351235Z ^ 2025-05-07T20:07:13.7351442Z 2025-05-07T20:07:13.7352450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7353860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7354520Z ^ 2025-05-07T20:07:13.7354674Z 2025-05-07T20:07:13.7354928Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.7355312Z 2025-05-07T20:07:13.7356186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7357615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7358265Z ^ 2025-05-07T20:07:13.7358546Z 2025-05-07T20:07:13.7359413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7360841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7361518Z ^ 2025-05-07T20:07:13.7361695Z 2025-05-07T20:07:13.7361945Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.7362305Z 2025-05-07T20:07:13.7363197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7364642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7365483Z ^ 2025-05-07T20:07:13.7365691Z 2025-05-07T20:07:13.7366555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7367965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7368596Z ^ 2025-05-07T20:07:13.7368736Z 2025-05-07T20:07:13.7368975Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.7369338Z 2025-05-07T20:07:13.7370205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.7371601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.7372224Z ^ 2025-05-07T20:07:13.7372430Z 2025-05-07T20:07:13.9222013Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:13.9233761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9235185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9235816Z ^ 2025-05-07T20:07:13.9235961Z 2025-05-07T20:07:13.9236221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.9236577Z 2025-05-07T20:07:13.9237447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9238851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9239494Z ^ 2025-05-07T20:07:13.9239690Z 2025-05-07T20:07:13.9240544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9241932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9242559Z ^ 2025-05-07T20:07:13.9242733Z 2025-05-07T20:07:13.9242973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.9243340Z 2025-05-07T20:07:13.9244199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9245599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9246221Z ^ 2025-05-07T20:07:13.9246420Z 2025-05-07T20:07:13.9247289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9248669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9249301Z ^ 2025-05-07T20:07:13.9249440Z 2025-05-07T20:07:13.9249691Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.9250046Z 2025-05-07T20:07:13.9250910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9252314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9252989Z ^ 2025-05-07T20:07:13.9253188Z 2025-05-07T20:07:13.9254042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9255478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9256092Z ^ 2025-05-07T20:07:13.9256246Z 2025-05-07T20:07:13.9256487Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.9256840Z 2025-05-07T20:07:13.9257746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9259136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9259775Z ^ 2025-05-07T20:07:13.9259970Z 2025-05-07T20:07:13.9260835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9262212Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9262832Z ^ 2025-05-07T20:07:13.9262971Z 2025-05-07T20:07:13.9263210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:13.9263572Z 2025-05-07T20:07:13.9264428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:13.9266079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:13.9266783Z ^ 2025-05-07T20:07:13.9266998Z 2025-05-07T20:07:14.6189858Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:14.7082698Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:14.7094040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7095461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7096089Z ^ 2025-05-07T20:07:14.7096249Z 2025-05-07T20:07:14.7096495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.7096850Z 2025-05-07T20:07:14.7097736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7099215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7099868Z ^ 2025-05-07T20:07:14.7100070Z 2025-05-07T20:07:14.7100998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7102384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7103017Z ^ 2025-05-07T20:07:14.7103160Z 2025-05-07T20:07:14.7103461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.7103835Z 2025-05-07T20:07:14.7104705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7106111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7106740Z ^ 2025-05-07T20:07:14.7106952Z 2025-05-07T20:07:14.7107805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7109213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7109837Z ^ 2025-05-07T20:07:14.7109992Z 2025-05-07T20:07:14.7110231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.7110582Z 2025-05-07T20:07:14.7111449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7112973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7113619Z ^ 2025-05-07T20:07:14.7113819Z 2025-05-07T20:07:14.7114674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7116062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7116690Z ^ 2025-05-07T20:07:14.7116828Z 2025-05-07T20:07:14.7117069Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.7117433Z 2025-05-07T20:07:14.7118296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7119697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7120317Z ^ 2025-05-07T20:07:14.7120512Z 2025-05-07T20:07:14.7121382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7122826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7123464Z ^ 2025-05-07T20:07:14.7123611Z 2025-05-07T20:07:14.7123909Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:14.7124276Z 2025-05-07T20:07:14.7125146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:14.7126569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:14.7127267Z ^ 2025-05-07T20:07:14.7127475Z 2025-05-07T20:07:15.2946843Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:15.2958384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2959785Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2960427Z ^ 2025-05-07T20:07:15.2960573Z 2025-05-07T20:07:15.2960826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2961295Z 2025-05-07T20:07:15.2962161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2963563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2964256Z ^ 2025-05-07T20:07:15.2964469Z 2025-05-07T20:07:15.2965483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2966995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2967616Z ^ 2025-05-07T20:07:15.2967777Z 2025-05-07T20:07:15.2968027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2968389Z 2025-05-07T20:07:15.2969265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2970666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2971308Z ^ 2025-05-07T20:07:15.2971505Z 2025-05-07T20:07:15.2972356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2973750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2974380Z ^ 2025-05-07T20:07:15.2974521Z 2025-05-07T20:07:15.2974761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2975124Z 2025-05-07T20:07:15.2975989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2977523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2978133Z ^ 2025-05-07T20:07:15.2978333Z 2025-05-07T20:07:15.2979171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2980526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2981126Z ^ 2025-05-07T20:07:15.2981262Z 2025-05-07T20:07:15.2981513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2981859Z 2025-05-07T20:07:15.2982700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2984061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2984684Z ^ 2025-05-07T20:07:15.2984875Z 2025-05-07T20:07:15.2985749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2987103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2987712Z ^ 2025-05-07T20:07:15.2987884Z 2025-05-07T20:07:15.2988120Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.2988462Z 2025-05-07T20:07:15.2989314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.2990704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.2991329Z ^ 2025-05-07T20:07:15.2991589Z 2025-05-07T20:07:15.3051855Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:15.3062986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3064361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3065129Z ^ 2025-05-07T20:07:15.3065543Z 2025-05-07T20:07:15.3065813Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.3066172Z 2025-05-07T20:07:15.3067038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3068496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3069143Z ^ 2025-05-07T20:07:15.3069341Z 2025-05-07T20:07:15.3070194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3071703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3072340Z ^ 2025-05-07T20:07:15.3072484Z 2025-05-07T20:07:15.3072723Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.3073092Z 2025-05-07T20:07:15.3073957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3075375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3076010Z ^ 2025-05-07T20:07:15.3076205Z 2025-05-07T20:07:15.3077074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3078446Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3079070Z ^ 2025-05-07T20:07:15.3079207Z 2025-05-07T20:07:15.3079456Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.3079845Z 2025-05-07T20:07:15.3080705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3082105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3082742Z ^ 2025-05-07T20:07:15.3082938Z 2025-05-07T20:07:15.3083793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3085256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3085856Z ^ 2025-05-07T20:07:15.3086004Z 2025-05-07T20:07:15.3086240Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.3086582Z 2025-05-07T20:07:15.3087433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3088788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3089438Z ^ 2025-05-07T20:07:15.3089629Z 2025-05-07T20:07:15.3090474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3091845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3092458Z ^ 2025-05-07T20:07:15.3092592Z 2025-05-07T20:07:15.3092837Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:15.3093179Z 2025-05-07T20:07:15.3094045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:15.3095416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:15.3096027Z ^ 2025-05-07T20:07:15.3096229Z 2025-05-07T20:07:17.7953001Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:17.7964666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7966426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7967147Z ^ 2025-05-07T20:07:17.7967293Z 2025-05-07T20:07:17.7967539Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7967910Z 2025-05-07T20:07:17.7968880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7970302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7970933Z ^ 2025-05-07T20:07:17.7971143Z 2025-05-07T20:07:17.7972053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7973452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7974067Z ^ 2025-05-07T20:07:17.7974206Z 2025-05-07T20:07:17.7974457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7974812Z 2025-05-07T20:07:17.7975679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7977084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7977820Z ^ 2025-05-07T20:07:17.7978012Z 2025-05-07T20:07:17.7978842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7980195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7980857Z ^ 2025-05-07T20:07:17.7980993Z 2025-05-07T20:07:17.7981229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7981588Z 2025-05-07T20:07:17.7982424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7983798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7984409Z ^ 2025-05-07T20:07:17.7984603Z 2025-05-07T20:07:17.7985442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7986781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7987391Z ^ 2025-05-07T20:07:17.7987526Z 2025-05-07T20:07:17.7987774Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7988118Z 2025-05-07T20:07:17.7988955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7990353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7990969Z ^ 2025-05-07T20:07:17.7991158Z 2025-05-07T20:07:17.7992292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7993691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7994308Z ^ 2025-05-07T20:07:17.7994464Z 2025-05-07T20:07:17.7994706Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.7995060Z 2025-05-07T20:07:17.7995984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.7997369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.7998004Z ^ 2025-05-07T20:07:17.7998202Z 2025-05-07T20:07:18.5070971Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && : 2025-05-07T20:07:18.5714001Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:18.5715224Z ################################################################################ 2025-05-07T20:07:18.5715577Z [CMAKE] Running post-build script ... 2025-05-07T20:07:18.5716129Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:18.5716668Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:18.5717056Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:18.5717469Z ################################################################################ 2025-05-07T20:08:38.3808086Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:08:38.3819920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3821235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3821818Z ^ 2025-05-07T20:08:38.3821970Z 2025-05-07T20:08:38.3822293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:38.3822635Z 2025-05-07T20:08:38.3823452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3824753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3825360Z ^ 2025-05-07T20:08:38.3825547Z 2025-05-07T20:08:38.3826353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3827643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3828240Z ^ 2025-05-07T20:08:38.3828376Z 2025-05-07T20:08:38.3828621Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:38.3828952Z 2025-05-07T20:08:38.3829753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3831069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3831756Z ^ 2025-05-07T20:08:38.3831961Z 2025-05-07T20:08:38.3832977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3834405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3835027Z ^ 2025-05-07T20:08:38.3835185Z 2025-05-07T20:08:38.3835429Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:38.3835784Z 2025-05-07T20:08:38.3836695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3838096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3838735Z ^ 2025-05-07T20:08:38.3838934Z 2025-05-07T20:08:38.3839788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3841190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3841824Z ^ 2025-05-07T20:08:38.3841966Z 2025-05-07T20:08:38.3842210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:38.3842578Z 2025-05-07T20:08:38.3843449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3844926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3845541Z ^ 2025-05-07T20:08:38.3845741Z 2025-05-07T20:08:38.3846536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3847834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3848407Z ^ 2025-05-07T20:08:38.3848541Z 2025-05-07T20:08:38.3848778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:38.3849108Z 2025-05-07T20:08:38.3849908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:38.3851213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:38.3851809Z ^ 2025-05-07T20:08:38.3851991Z 2025-05-07T20:08:46.4166333Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:08:46.4178917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4180343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4180977Z ^ 2025-05-07T20:08:46.4181121Z 2025-05-07T20:08:46.4181363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:46.4181731Z 2025-05-07T20:08:46.4182582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4183959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4184686Z ^ 2025-05-07T20:08:46.4184885Z 2025-05-07T20:08:46.4185681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4186970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4187546Z ^ 2025-05-07T20:08:46.4187694Z 2025-05-07T20:08:46.4187922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:46.4188253Z 2025-05-07T20:08:46.4189140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4190442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4191078Z ^ 2025-05-07T20:08:46.4191266Z 2025-05-07T20:08:46.4192311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4193855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4194523Z ^ 2025-05-07T20:08:46.4194667Z 2025-05-07T20:08:46.4194914Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:46.4195288Z 2025-05-07T20:08:46.4196150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4197555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4198292Z ^ 2025-05-07T20:08:46.4198488Z 2025-05-07T20:08:46.4199279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4200583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4201157Z ^ 2025-05-07T20:08:46.4201305Z 2025-05-07T20:08:46.4201531Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:46.4201858Z 2025-05-07T20:08:46.4202673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4204001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4204608Z ^ 2025-05-07T20:08:46.4204792Z 2025-05-07T20:08:46.4205591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4206889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4207482Z ^ 2025-05-07T20:08:46.4207615Z 2025-05-07T20:08:46.4207842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:46.4208187Z 2025-05-07T20:08:46.4208994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:46.4210300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:46.4210886Z ^ 2025-05-07T20:08:46.4211081Z 2025-05-07T20:08:52.3845480Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:08:52.3865919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3867337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3868007Z ^ 2025-05-07T20:08:52.3868160Z 2025-05-07T20:08:52.3868432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:52.3868800Z 2025-05-07T20:08:52.3869676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3871095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3871833Z ^ 2025-05-07T20:08:52.3872039Z 2025-05-07T20:08:52.3872902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3874392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3875019Z ^ 2025-05-07T20:08:52.3875183Z 2025-05-07T20:08:52.3875431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:52.3875793Z 2025-05-07T20:08:52.3876728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3878242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3878846Z ^ 2025-05-07T20:08:52.3879038Z 2025-05-07T20:08:52.3879887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3881368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3881999Z ^ 2025-05-07T20:08:52.3882144Z 2025-05-07T20:08:52.3882405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:52.3882755Z 2025-05-07T20:08:52.3883600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3884980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3885604Z ^ 2025-05-07T20:08:52.3885822Z 2025-05-07T20:08:52.3886664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3888098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3888719Z ^ 2025-05-07T20:08:52.3888880Z 2025-05-07T20:08:52.3889109Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:52.3889442Z 2025-05-07T20:08:52.3890271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3891565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3892174Z ^ 2025-05-07T20:08:52.3892364Z 2025-05-07T20:08:52.3893161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3894470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3895070Z ^ 2025-05-07T20:08:52.3895204Z 2025-05-07T20:08:52.3895433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:08:52.3895784Z 2025-05-07T20:08:52.3896596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:08:52.3897946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:08:52.3898536Z ^ 2025-05-07T20:08:52.3898744Z 2025-05-07T20:08:53.9492861Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && : 2025-05-07T20:08:54.5198043Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:08:54.5555356Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && : 2025-05-07T20:08:54.6110211Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:08:54.6111814Z ################################################################################ 2025-05-07T20:08:54.6112203Z [CMAKE] Running post-build script ... 2025-05-07T20:08:54.6112857Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:08:54.6113503Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:54.6113901Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:54.6114559Z ################################################################################ 2025-05-07T20:08:54.6463680Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:08:54.6465413Z ################################################################################ 2025-05-07T20:08:54.6465779Z [CMAKE] Running post-build script ... 2025-05-07T20:08:54.6466425Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:08:54.6467091Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:54.6467470Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:54.6467968Z ################################################################################ 2025-05-07T20:08:54.7232900Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:08:54.7236699Z ################################################################################ 2025-05-07T20:08:54.7237390Z [CMAKE] Running post-build script ... 2025-05-07T20:08:54.7238014Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:08:54.7238754Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:54.7239105Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:54.7239507Z ################################################################################ 2025-05-07T20:08:54.8664470Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && : 2025-05-07T20:08:55.1591899Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:08:55.1593381Z ################################################################################ 2025-05-07T20:08:55.1593781Z [CMAKE] Running post-build script ... 2025-05-07T20:08:55.1594439Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:08:55.1595091Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:08:55.1595493Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:08:55.1595916Z ################################################################################ 2025-05-07T20:08:55.1597099Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.12/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:08:55.1634319Z -- Install configuration: "Release" 2025-05-07T20:08:55.1636067Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:08:55.1656504Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:08:55.1659159Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:08:55.1671342Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:08:55.1674427Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:08:55.1693649Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:08:55.1715530Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:08:55.1718600Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:08:55.1719641Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:08:55.1745791Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:08:55.1746827Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:08:55.1747997Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:01.4021353Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:02.5329143Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:05.1455135Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:05.6113980Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:05.6117286Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:05.6120340Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:05.6121612Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:05.6122889Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:05.6124084Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:05.6125299Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:05.6126534Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:05.6128034Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:05.6129386Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:05.6130663Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:05.6131966Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:05.6133347Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:05.6134616Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:05.6135803Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:05.6137031Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:05.6138350Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:05.6139739Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:05.6140868Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:05.6144021Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:05.6194277Z 2025-05-07T20:09:05.6243739Z 2025-05-07T20:09:05.6244455Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:05.6247156Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:05.6248295Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:05.6249122Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:05.6250203Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:05.6251498Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:05.6252726Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:05.6253635Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:05.6254539Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:05.6255494Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:05.6256524Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:05.6257760Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:05.6259385Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:05.6260469Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:05.6261590Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:05.6262869Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:05.6264318Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:05.6265917Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:05.6267465Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:05.6268881Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:05.6270140Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:05.6271004Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:05.6271912Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:05.6272759Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:05.6273737Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:05.6274694Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:05.6275462Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:05.6276383Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:05.6277314Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:05.6278306Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:05.6279466Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:05.6280748Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:05.6281906Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:05.6282899Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:05.6283779Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:05.6284617Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:05.6285434Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:05.6286469Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:05.6287362Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:05.6288171Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:05.6288916Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:05.6289710Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:05.6290468Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:05.6291303Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:05.6292232Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:05.6293183Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:05.6294174Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:05.6295063Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:05.6295820Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:05.6296740Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:05.6297695Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:05.6298661Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:05.6299571Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:05.6300409Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:05.6301343Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:05.6302243Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:05.6303040Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:05.6304000Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:05.6304819Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6305748Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:05.6306783Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:05.6308000Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:05.6309499Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:05.6310816Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:05.6312138Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:05.6313640Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:05.6315259Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:05.6316959Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:05.6318487Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:05.6320098Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:05.6321596Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:05.6323009Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:05.6324167Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6324991Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:05.6326037Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:05.6327075Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:05.6328102Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:05.6329303Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:05.6330557Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:05.6331711Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:05.6332788Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:05.6333983Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:05.6335367Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:05.6336515Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:05.6337347Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:05.6338112Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:05.6339221Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:05.6340205Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:05.6340937Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:05.6341799Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:05.6342671Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:05.6343591Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:05.6344368Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:05.6345299Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:05.6346337Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:05.6347225Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:05.6348014Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:05.6348918Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:05.6349811Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:05.6350747Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:05.6351741Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:05.6352555Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:05.6353384Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:05.6354551Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:05.6355590Z creating directory _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:05.6356486Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:05.6357584Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:05.6358285Z 2025-05-07T20:09:05.6471209Z INFO:root:running bdist_wheel 2025-05-07T20:09:05.6519887Z INFO:root:running build 2025-05-07T20:09:05.6520260Z INFO:root:running build_py 2025-05-07T20:09:05.6525619Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6527628Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6530231Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6531649Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6532891Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6534372Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6535879Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6537282Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6539154Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6540764Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6542261Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6544300Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6546494Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6547949Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6549448Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6550991Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6552591Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6554297Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6556426Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6559968Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6561511Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6562982Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6564293Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6566901Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/config 2025-05-07T20:09:05.6568141Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/config 2025-05-07T20:09:05.6569701Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/config 2025-05-07T20:09:05.6571987Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6573213Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6574752Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6576221Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6577696Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6579266Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6580765Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6582224Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6583602Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6585495Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:05.6587939Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize 2025-05-07T20:09:05.6589155Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize 2025-05-07T20:09:05.6590741Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize 2025-05-07T20:09:05.6592715Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll 2025-05-07T20:09:05.6593933Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll 2025-05-07T20:09:05.6595892Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe 2025-05-07T20:09:05.6597046Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe 2025-05-07T20:09:05.6599100Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:05.6600253Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:05.6601821Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:05.6603301Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:05.6605466Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:05.6607451Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:05.6608639Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:05.6610202Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:05.6611610Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:05.6613073Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:05.6614950Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/cpu 2025-05-07T20:09:05.6616124Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/cpu 2025-05-07T20:09:05.6617700Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/cpu 2025-05-07T20:09:05.6619795Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/meta 2025-05-07T20:09:05.6620970Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/meta 2025-05-07T20:09:05.6622512Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/meta 2025-05-07T20:09:05.6625052Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6626249Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6627993Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6629778Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6631403Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6633114Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6634685Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6636422Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6638128Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6639867Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6641556Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6643332Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6645015Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6646659Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:05.6648829Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6650008Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6651605Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6653177Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6654745Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6656309Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6657903Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6659556Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6661021Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6662801Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6664359Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6666013Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:05.6668129Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/cache 2025-05-07T20:09:05.6669247Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/cache 2025-05-07T20:09:05.6670762Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/cache 2025-05-07T20:09:05.6672957Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:05.6674325Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:05.6675760Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:05.6677313Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:05.6678850Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:05.6681653Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/stats 2025-05-07T20:09:05.6682855Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/stats 2025-05-07T20:09:05.6684562Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/stats 2025-05-07T20:09:05.6686479Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:05.6687622Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:05.6689206Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:05.6690710Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:05.6692155Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:05.6693661Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:05.6695623Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:05.6697020Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:05.6698611Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:05.6700168Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/jagged 2025-05-07T20:09:05.6701364Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/jagged 2025-05-07T20:09:05.6702919Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/jagged 2025-05-07T20:09:05.6748486Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.6774036Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.7021803Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:05.8192706Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:09.2474306Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:09.2478704Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:09.3759116Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:09.3872764Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:09.4091253Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:09.4785769Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:12.2655443Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:12.3473819Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:19.5634774Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:20.7767673Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:23.3921079Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:23.8578174Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:23.8954107Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.1673535Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1675122Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1678649Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1684505Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1690480Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1697158Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1703415Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1709087Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1714757Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1720124Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1726172Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1731334Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1737460Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1755719Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1757816Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:24.1766491Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:24.1769139Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:24.1775040Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:24.1779858Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.1808813Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7553768Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7555133Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7556482Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7557725Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7559054Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7560531Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7561926Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7563233Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7564841Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7566602Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7569515Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7571173Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7572851Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7574359Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7575966Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7577578Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7579234Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7581559Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7584896Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7586556Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7588110Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7589577Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu 2025-05-07T20:09:24.7591487Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/config 2025-05-07T20:09:24.7593269Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/config 2025-05-07T20:09:24.7594827Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7596506Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7598126Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7599760Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7601419Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7603558Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7605316Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7607080Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7609012Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs 2025-05-07T20:09:24.7610494Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize 2025-05-07T20:09:24.7613900Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize 2025-05-07T20:09:24.7615386Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll 2025-05-07T20:09:24.7617471Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe 2025-05-07T20:09:24.7619085Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:24.7620690Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:24.7622227Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:24.7623987Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton 2025-05-07T20:09:24.7625603Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:24.7627217Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:24.7628760Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:24.7630308Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils 2025-05-07T20:09:24.7631974Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.7633763Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.7635978Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/meta 2025-05-07T20:09:24.7637507Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/meta 2025-05-07T20:09:24.7638989Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7640606Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7642345Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7643991Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7645568Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7647127Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7648839Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7650548Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7652268Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7654205Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7656034Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7657690Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7659376Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7661094Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7662788Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7664554Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7666564Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7668412Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7670076Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7673666Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7675278Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7676934Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7678565Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7680072Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7681669Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.7683426Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.7685061Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7686516Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7688022Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7690164Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7692563Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.7694283Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.7695818Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7697439Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7699013Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7700558Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7702177Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7704101Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.7705818Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.7707450Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.7709059Z INFO:root:copying _skbuild/linux-x86_64-3.12/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.7727139Z INFO:skbuild:copied 90 files 2025-05-07T20:09:24.7727468Z INFO:root:running build_ext 2025-05-07T20:09:24.7727949Z INFO:root:installing to _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:24.7728436Z INFO:root:running install 2025-05-07T20:09:24.7787578Z INFO:root:running install_lib 2025-05-07T20:09:24.7788438Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:24.7789173Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:24.7789929Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:24.7791117Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:24.7792826Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:24.7794142Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:24.7795285Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7796812Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7798336Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7799911Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7801562Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7803254Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7804880Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7806506Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7808092Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:24.7809245Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:24.7810495Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:24.7812111Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:24.7813295Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:24.7814063Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.7815230Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.7816800Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:24.7817989Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:24.7819163Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:24.7820759Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:24.7822010Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7823223Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7824846Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7826581Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7828411Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7830176Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7832004Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7833887Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7835815Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7837728Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7839625Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7841508Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7843352Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7845182Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:24.7846869Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:24.7847997Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:24.7848774Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7850000Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7851647Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7853298Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7854910Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7856609Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7858367Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7860018Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7861694Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7863427Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7865314Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7867068Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:24.7868273Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.7869481Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.7871175Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:24.7872515Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7873309Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.7874581Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.7876377Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:24.7878152Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7879723Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7881336Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7882932Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:24.7884140Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.7885347Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.7887010Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:24.7888286Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7889497Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7891148Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7892833Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7894465Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7896108Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:24.7897674Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:24.7898810Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:24.7899601Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.7900858Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.7902608Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:24.7904280Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.7905835Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.7907443Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.7909026Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:24.7910208Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:24.7911356Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.7912951Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.7914515Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.7916084Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:24.7917611Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.7919041Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.7934054Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:24.8071384Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.0815668Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.0817350Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.0920147Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.0935624Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.0954148Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.1014089Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.3142745Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.3210308Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.8884329Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:25.9783113Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.1813128Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2178294Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2210943Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2424620Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2426500Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2428809Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2431003Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2433305Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2435466Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2437616Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2439819Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2442058Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2444312Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2446535Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2448826Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2450996Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2453115Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2455248Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:26.2456846Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:26.2458524Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:26.2460748Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:26.2462612Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2464155Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2899952Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2901567Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2903087Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2904525Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2906078Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2907950Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2909560Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2911086Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2912705Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2914232Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2915787Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2917432Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2919219Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2920909Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2922532Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2924332Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2926112Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2927859Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2929663Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2931426Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2933034Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2934511Z INFO:root:copying _skbuild/linux-x86_64-3.12/setuptools/lib.linux-x86_64-cpython-312/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:26.2935420Z INFO:skbuild:copied 125 files 2025-05-07T20:09:26.2935718Z INFO:root:running install_egg_info 2025-05-07T20:09:26.2976130Z INFO:root:running egg_info 2025-05-07T20:09:26.3019984Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:26.3021830Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:26.3023961Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:26.3024988Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:26.3131238Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:26.3163722Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:26.3165022Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.12.egg-info 2025-05-07T20:09:26.3172316Z INFO:root:running install_scripts 2025-05-07T20:09:26.3172881Z INFO:skbuild:copied 0 files 2025-05-07T20:09:29.1026664Z INFO:root:creating _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:29.1028138Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-8whs2af1/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:29.1030727Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:29.1294950Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:29.1309462Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:29.1310222Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:29.3334409Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:29.3467205Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:29.3600359Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:31.0743051Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:31.2768554Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:31.9872140Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:32.0950677Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:32.6889950Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:09:50.5355423Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:09:51.7721240Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:18.9240811Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:21.7379883Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:25.3462077Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:26.0373758Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:26.2559173Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:10:34.8638553Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:10:45.7679776Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:10:47.2353971Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:10:47.2711105Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:10:47.2711804Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:10:47.2712364Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:10:47.2715718Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:10:47.2719107Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:10:47.2722519Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:10:47.2733763Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:10:47.2737764Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:10:47.2740987Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:10:47.2742913Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:10:47.2744693Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:10:47.2746864Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:10:47.2750411Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:10:47.2772252Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:10:47.2817584Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:10:47.2820150Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:10:47.2822027Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:10:47.2824260Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:10:47.2826163Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:10:47.2828781Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:10:47.2830976Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:10:47.2833370Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:10:47.2834977Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:10:47.2837196Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:10:47.2840030Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:10:47.2842148Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:10:47.2844713Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:10:47.2846721Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:10:47.2852842Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:10:47.2855103Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:10:47.2857179Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:10:47.2859396Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:10:47.2861666Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:10:47.2864076Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:10:47.2871097Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:10:47.2874031Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:10:47.2876819Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:10:47.2879694Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:10:47.2881514Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:10:47.2883700Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:10:47.2886410Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:10:47.2890330Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:10:47.2895111Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:10:47.2897616Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:10:47.2900230Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:10:47.2906049Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:10:47.2911844Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:10:47.2914481Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:10:47.2918540Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:10:47.2924176Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:10:47.2927174Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:10:47.2930466Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:10:47.2934391Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:10:47.2936851Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:10:47.2939093Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:10:47.2942386Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:10:47.2945855Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:10:47.2949096Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:10:47.2953224Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:10:47.2956724Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:10:47.2960099Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:10:47.2963709Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:10:47.2967991Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:10:47.2970987Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:10:47.2973300Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:10:47.2976160Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:10:47.2978004Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:10:47.2980476Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:10:47.2982867Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:10:47.2988118Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:10:47.2990960Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:10:47.2993778Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:10:47.2995975Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:10:47.2997814Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:10:47.3001319Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:10:47.3004422Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:10:47.3007183Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:10:47.3009218Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:10:47.3011185Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:10:47.3013147Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:10:47.3015018Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:10:47.3016674Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:10:47.3022859Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:10:47.3049900Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:10:47.3051880Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:10:47.3055111Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:10:47.3057132Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:10:47.3060133Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:10:47.3062192Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:10:47.3064063Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:10:47.3066467Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:10:47.3069253Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:10:47.3075315Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:10:47.3077749Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:10:47.3079842Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:10:47.3087733Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:10:47.3092724Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:10:47.3094993Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:10:47.3103241Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:10:47.3105849Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:10:47.3108398Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:10:47.3110322Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:10:47.3118354Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:10:47.3118901Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:10:47.3119422Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:10:47.3119947Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:10:47.3124323Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:10:47.3127990Z INFO:root:removing _skbuild/linux-x86_64-3.12/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:10:47.4736913Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:10:47.4737443Z │ │ Version │ 2025-05-07T20:10:47.4738000Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:10:47.4738785Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:10:47.4739321Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:47.4739865Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:10:47.4740448Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:10:47.4741036Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:10:47.4741568Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:10:47.4742060Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:10:47.4742543Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:10:47.4743072Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:10:47.4743627Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:10:47.7633354Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:10:47.8498169Z 2025-05-07T20:10:47.8649646Z ################################################################################ 2025-05-07T20:10:47.8650196Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:10:47.8650676Z [CHECK] Listing out library size: 2025-05-07T20:10:47.8651085Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:10:47.8651398Z 2025-05-07T20:10:47.8660420Z 1 ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:10:47.8664199Z 2025-05-07T20:10:47.8665798Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:10:47.8666745Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.8667321Z 2025-05-07T20:10:47.8732485Z GLIBC_2.2.5 2025-05-07T20:10:47.8733159Z GLIBC_2.14 2025-05-07T20:10:47.8733532Z 2025-05-07T20:10:47.8733546Z 2025-05-07T20:10:47.8734574Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:10:47.8736517Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.8737163Z 2025-05-07T20:10:47.8794958Z GLIBCXX_3.4 2025-05-07T20:10:47.8795618Z 2025-05-07T20:10:47.8795670Z 2025-05-07T20:10:47.8817298Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so > /tmp/tmp.ZlIJj0RZQ7.symbols.txt 2025-05-07T20:10:47.8818601Z 2025-05-07T20:10:47.8851938Z 2025-05-07T20:10:47.8883755Z [CHECK] Total Number of symbols: 841 2025-05-07T20:10:47.8899356Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:10:47.8914902Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so > /tmp/tmp.2AHTHdZQHq.usymbols.txt 2025-05-07T20:10:47.8916562Z 2025-05-07T20:10:47.8933714Z 2025-05-07T20:10:47.8968877Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:10:47.8993511Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.8994569Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.8995782Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.8996759Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.8997690Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:47.8998608Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.8998935Z U abort@GLIBC_2.2.5 2025-05-07T20:10:47.8999249Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:47.8999537Z U close@GLIBC_2.2.5 2025-05-07T20:10:47.8999842Z U fputs@GLIBC_2.2.5 2025-05-07T20:10:47.9000132Z U free@GLIBC_2.2.5 2025-05-07T20:10:47.9000440Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:10:47.9000767Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:47.9001059Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:47.9001376Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:10:47.9001687Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:47.9001997Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:47.9002293Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:47.9002775Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.9003073Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.9003386Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.9003694Z U mmap@GLIBC_2.2.5 2025-05-07T20:10:47.9003984Z U mprotect@GLIBC_2.2.5 2025-05-07T20:10:47.9004306Z U munmap@GLIBC_2.2.5 2025-05-07T20:10:47.9004599Z U open64@GLIBC_2.2.5 2025-05-07T20:10:47.9004997Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.9005357Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:10:47.9005721Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:47.9006063Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:47.9006407Z U read@GLIBC_2.2.5 2025-05-07T20:10:47.9006704Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:47.9007025Z U shm_open@GLIBC_2.2.5 2025-05-07T20:10:47.9007346Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:10:47.9007650Z U snprintf@GLIBC_2.2.5 2025-05-07T20:10:47.9008002Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.9008435Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:47.9008737Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:47.9009022Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.9009323Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:47.9009611Z U syscall@GLIBC_2.2.5 2025-05-07T20:10:47.9009917Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:47.9010218Z U uname@GLIBC_2.2.5 2025-05-07T20:10:47.9010501Z U unlink@GLIBC_2.2.5 2025-05-07T20:10:47.9010811Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:10:47.9011167Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.9011608Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.9012098Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.9012505Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.9012850Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.9013170Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.9013493Z w __gmon_start__ 2025-05-07T20:10:47.9013831Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.9014255Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:10:47.9014515Z 2025-05-07T20:10:47.9051391Z linux-vdso.so.1 (0x00007fffd4186000) 2025-05-07T20:10:47.9052213Z libtorch.so => not found 2025-05-07T20:10:47.9052588Z libc10.so => not found 2025-05-07T20:10:47.9052892Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.9053196Z libc10_cuda.so => not found 2025-05-07T20:10:47.9053550Z libnccl.so.2 => not found 2025-05-07T20:10:47.9053853Z libcuda.so.1 => not found 2025-05-07T20:10:47.9054120Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.9054430Z libtorch_cpu.so => not found 2025-05-07T20:10:47.9054709Z libtorch_cuda.so => not found 2025-05-07T20:10:47.9055073Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f45b0c27000) 2025-05-07T20:10:47.9055511Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f45b0bd1000) 2025-05-07T20:10:47.9055929Z librt.so.1 => /lib64/librt.so.1 (0x00007f45b0bca000) 2025-05-07T20:10:47.9056325Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f45b0b9c000) 2025-05-07T20:10:47.9056788Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f45b0b97000) 2025-05-07T20:10:47.9057226Z libc.so.6 => /lib64/libc.so.6 (0x00007f45b098f000) 2025-05-07T20:10:47.9057588Z libm.so.6 => /lib64/libm.so.6 (0x00007f45b08b4000) 2025-05-07T20:10:47.9057973Z /lib64/ld-linux-x86-64.so.2 (0x00007f45b0f08000) 2025-05-07T20:10:47.9058224Z 2025-05-07T20:10:47.9058343Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.9058748Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so 2025-05-07T20:10:47.9059044Z 2025-05-07T20:10:47.9098517Z 2025-05-07T20:10:47.9098870Z Dynamic section at offset 0x75898 contains 39 entries: 2025-05-07T20:10:47.9099347Z Tag Type Name/Value 2025-05-07T20:10:47.9099803Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.9100340Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.9100856Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.9101468Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.9102003Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.9102537Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.9103088Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.9103621Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.9104173Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.9104701Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.9105235Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.9105746Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:10:47.9106365Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.9106860Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:47.9107340Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.9107817Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:10:47.9108207Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:10:47.9108532Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:10:47.9108914Z 0x0000000000000019 (INIT_ARRAY) 0x74ac0 2025-05-07T20:10:47.9109261Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.9109610Z 0x000000000000001a (FINI_ARRAY) 0x74ac8 2025-05-07T20:10:47.9109934Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.9110282Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.9110593Z 0x0000000000000005 (STRTAB) 0x6980 2025-05-07T20:10:47.9110916Z 0x0000000000000006 (SYMTAB) 0x1a90 2025-05-07T20:10:47.9111360Z 0x000000000000000a (STRSZ) 48829 (bytes) 2025-05-07T20:10:47.9111964Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.9112366Z 0x0000000000000003 (PLTGOT) 0x75fe8 2025-05-07T20:10:47.9112741Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:10:47.9113110Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.9113438Z 0x0000000000000017 (JMPREL) 0x162e0 2025-05-07T20:10:47.9113785Z 0x0000000000000007 (RELA) 0x12f98 2025-05-07T20:10:47.9114134Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:10:47.9114511Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.9114859Z 0x000000006ffffffe (VERNEED) 0x12ed8 2025-05-07T20:10:47.9115211Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:47.9115542Z 0x000000006ffffff0 (VERSYM) 0x1283e 2025-05-07T20:10:47.9115890Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:10:47.9116222Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.9116441Z 2025-05-07T20:10:47.9116557Z ################################################################################ 2025-05-07T20:10:47.9116803Z 2025-05-07T20:10:47.9116807Z 2025-05-07T20:10:47.9116922Z ################################################################################ 2025-05-07T20:10:47.9117401Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.9117900Z [CHECK] Listing out library size: 2025-05-07T20:10:47.9118401Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.9118762Z 2025-05-07T20:10:47.9118947Z 1 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.9119260Z 2025-05-07T20:10:47.9119633Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.9120633Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.9121212Z 2025-05-07T20:10:47.9174044Z GLIBC_2.2.5 2025-05-07T20:10:47.9174308Z GLIBC_2.14 2025-05-07T20:10:47.9176045Z 2025-05-07T20:10:47.9176204Z 2025-05-07T20:10:47.9176798Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.9177845Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.9178461Z 2025-05-07T20:10:47.9234032Z GLIBCXX_3.4 2025-05-07T20:10:47.9234284Z GLIBCXX_3.4.9 2025-05-07T20:10:47.9234521Z GLIBCXX_3.4.21 2025-05-07T20:10:47.9234776Z 2025-05-07T20:10:47.9234840Z 2025-05-07T20:10:47.9256496Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.4yHZK1ZJI7.symbols.txt 2025-05-07T20:10:47.9257001Z 2025-05-07T20:10:47.9279471Z 2025-05-07T20:10:47.9313829Z [CHECK] Total Number of symbols: 116 2025-05-07T20:10:47.9335037Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:47.9353591Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.jgmuX0FLOr.usymbols.txt 2025-05-07T20:10:47.9354104Z 2025-05-07T20:10:47.9367628Z 2025-05-07T20:10:47.9394350Z [CHECK] Listing out undefined symbols (55 total): 2025-05-07T20:10:47.9412256Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.9414257Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:47.9415248Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:47.9416162Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:47.9416994Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:47.9417347Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:47.9417677Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:47.9418019Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:47.9418481Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:10:47.9418833Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:47.9419191Z U c10::BoolType::get() 2025-05-07T20:10:47.9419499Z U c10::StringType::get() 2025-05-07T20:10:47.9419852Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:47.9420653Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:47.9421894Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:47.9422723Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:47.9423040Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:47.9423445Z U memcpy@GLIBC_2.14 2025-05-07T20:10:47.9423754Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:47.9424048Z U memset@GLIBC_2.2.5 2025-05-07T20:10:47.9424375Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:47.9424823Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:47.9425237Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:47.9425953Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:47.9426745Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:47.9427559Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:47.9428199Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:47.9428582Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.9428994Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.9429368Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.9429755Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:47.9430226Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:47.9431125Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:47.9432222Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:47.9432593Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:47.9432975Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:47.9433322Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:47.9433673Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:47.9434010Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:47.9434304Z U strtol@GLIBC_2.2.5 2025-05-07T20:10:47.9434640Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:47.9435518Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:47.9436757Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:47.9437904Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:47.9438516Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:10:47.9438946Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:47.9439362Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:47.9439769Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:47.9440356Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:47.9440987Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:47.9441431Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:47.9441762Z w _ITM_registerTMCloneTable 2025-05-07T20:10:47.9442060Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:47.9442359Z w __gmon_start__ 2025-05-07T20:10:47.9442639Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:10:47.9443003Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:47.9443422Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.9443723Z 2025-05-07T20:10:47.9453747Z linux-vdso.so.1 (0x00007ffd6cf11000) 2025-05-07T20:10:47.9454727Z libtorch.so => not found 2025-05-07T20:10:47.9455508Z libc10.so => not found 2025-05-07T20:10:47.9456287Z libnvrtc.so.12 => not found 2025-05-07T20:10:47.9457060Z libc10_cuda.so => not found 2025-05-07T20:10:47.9458039Z libnccl.so.2 => not found 2025-05-07T20:10:47.9458797Z libcuda.so.1 => not found 2025-05-07T20:10:47.9459415Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:47.9459705Z libtorch_cpu.so => not found 2025-05-07T20:10:47.9460002Z libtorch_cuda.so => not found 2025-05-07T20:10:47.9460362Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f3d20bbe000) 2025-05-07T20:10:47.9461004Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f3d20b68000) 2025-05-07T20:10:47.9461452Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f3d20b38000) 2025-05-07T20:10:47.9461872Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f3d20b33000) 2025-05-07T20:10:47.9462278Z libc.so.6 => /lib64/libc.so.6 (0x00007f3d2092b000) 2025-05-07T20:10:47.9462617Z libm.so.6 => /lib64/libm.so.6 (0x00007f3d20850000) 2025-05-07T20:10:47.9462976Z /lib64/ld-linux-x86-64.so.2 (0x00007f3d20e33000) 2025-05-07T20:10:47.9463206Z 2025-05-07T20:10:47.9463314Z [CHECK] Displaying ELF information: 2025-05-07T20:10:47.9463725Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:10:47.9464038Z 2025-05-07T20:10:47.9493342Z 2025-05-07T20:10:47.9493710Z Dynamic section at offset 0x8c98 contains 38 entries: 2025-05-07T20:10:47.9494114Z Tag Type Name/Value 2025-05-07T20:10:47.9494762Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:47.9495295Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:47.9495822Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:47.9496365Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:47.9496881Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:47.9497415Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:47.9498098Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:47.9498652Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:47.9499195Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:47.9499724Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:47.9500361Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:47.9500858Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:47.9501483Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:10:47.9502017Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:47.9502517Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:10:47.9502949Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:10:47.9503259Z 0x000000000000000d (FINI) 0x6f80 2025-05-07T20:10:47.9503590Z 0x0000000000000019 (INIT_ARRAY) 0x9bb0 2025-05-07T20:10:47.9503913Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:10:47.9504261Z 0x000000000000001a (FINI_ARRAY) 0x9bc0 2025-05-07T20:10:47.9504583Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:47.9504924Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:47.9505248Z 0x0000000000000005 (STRTAB) 0xed0 2025-05-07T20:10:47.9505547Z 0x0000000000000006 (SYMTAB) 0x3d8 2025-05-07T20:10:47.9505908Z 0x000000000000000a (STRSZ) 7795 (bytes) 2025-05-07T20:10:47.9506247Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:47.9506755Z 0x0000000000000003 (PLTGOT) 0x9fe8 2025-05-07T20:10:47.9507103Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:10:47.9507466Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:47.9507799Z 0x0000000000000017 (JMPREL) 0x33a0 2025-05-07T20:10:47.9508174Z 0x0000000000000007 (RELA) 0x2ef0 2025-05-07T20:10:47.9508711Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:10:47.9509075Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:47.9509447Z 0x000000006ffffffe (VERNEED) 0x2e30 2025-05-07T20:10:47.9509789Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:47.9510138Z 0x000000006ffffff0 (VERSYM) 0x2d44 2025-05-07T20:10:47.9510477Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:10:47.9512883Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:47.9513134Z 2025-05-07T20:10:47.9513273Z ################################################################################ 2025-05-07T20:10:47.9513505Z 2025-05-07T20:10:47.9513510Z 2025-05-07T20:10:47.9513627Z ################################################################################ 2025-05-07T20:10:47.9514087Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:10:47.9514525Z [CHECK] Listing out library size: 2025-05-07T20:10:47.9514956Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:10:47.9515278Z 2025-05-07T20:10:47.9515455Z 6 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:10:47.9515703Z 2025-05-07T20:10:47.9516038Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:10:47.9516934Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.9517475Z 2025-05-07T20:10:47.9777223Z GLIBC_2.2.5 2025-05-07T20:10:47.9777851Z GLIBC_2.3 2025-05-07T20:10:47.9778430Z GLIBC_2.14 2025-05-07T20:10:47.9778806Z 2025-05-07T20:10:47.9778821Z 2025-05-07T20:10:47.9779758Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:10:47.9780813Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:47.9781614Z 2025-05-07T20:10:48.0046114Z GLIBCXX_3.4 2025-05-07T20:10:48.0046768Z GLIBCXX_3.4.9 2025-05-07T20:10:48.0047453Z GLIBCXX_3.4.11 2025-05-07T20:10:48.0048068Z GLIBCXX_3.4.14 2025-05-07T20:10:48.0048690Z GLIBCXX_3.4.15 2025-05-07T20:10:48.0049279Z GLIBCXX_3.4.18 2025-05-07T20:10:48.0049904Z GLIBCXX_3.4.21 2025-05-07T20:10:48.0050032Z 2025-05-07T20:10:48.0050037Z 2025-05-07T20:10:48.0067476Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so > /tmp/tmp.Woh2f3ClLt.symbols.txt 2025-05-07T20:10:48.0068250Z 2025-05-07T20:10:48.0314362Z 2025-05-07T20:10:48.0348074Z [CHECK] Total Number of symbols: 4951 2025-05-07T20:10:48.0382286Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:10:48.0400859Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so > /tmp/tmp.2vY9Tt1QGm.usymbols.txt 2025-05-07T20:10:48.0402169Z 2025-05-07T20:10:48.0433008Z 2025-05-07T20:10:48.0466380Z [CHECK] Listing out undefined symbols (133 total): 2025-05-07T20:10:48.0477980Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.0479084Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.0480085Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.0480642Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.0480992Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.0481323Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.0481693Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.0482019Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.0482374Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.0482747Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:10:48.0483105Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.0483440Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:48.0483773Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.0485072Z U __extendhfsf2 2025-05-07T20:10:48.0485389Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.0485750Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:48.0486074Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.0486388Z U __truncsfhf2 2025-05-07T20:10:48.0486679Z U abort@GLIBC_2.2.5 2025-05-07T20:10:48.0487230Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:48.0488037Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:48.0489029Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:48.0490252Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:10:48.0491421Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:10:48.0492229Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:48.0492926Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:10:48.0493526Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:10:48.0494133Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:10:48.0494637Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:10:48.0495206Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:10:48.0495918Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:10:48.0496495Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:10:48.0496914Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:10:48.0497522Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:10:48.0498113Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:10:48.0498548Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:10:48.0499011Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:10:48.0499341Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:48.0499631Z U ceilf@GLIBC_2.2.5 2025-05-07T20:10:48.0499927Z U cpuinfo_get_packages 2025-05-07T20:10:48.0500223Z U cpuinfo_get_packages_count 2025-05-07T20:10:48.0500534Z U cpuinfo_initialize 2025-05-07T20:10:48.0500808Z U cpuinfo_isa 2025-05-07T20:10:48.0501072Z U floor@GLIBC_2.2.5 2025-05-07T20:10:48.0501341Z U fma@GLIBC_2.2.5 2025-05-07T20:10:48.0501615Z U fmaf@GLIBC_2.2.5 2025-05-07T20:10:48.0501881Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.0502161Z U fwrite@GLIBC_2.2.5 2025-05-07T20:10:48.0502436Z U getenv@GLIBC_2.2.5 2025-05-07T20:10:48.0502722Z U ldexp@GLIBC_2.2.5 2025-05-07T20:10:48.0503002Z U log2@GLIBC_2.2.5 2025-05-07T20:10:48.0503268Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:48.0503555Z U lrintf@GLIBC_2.2.5 2025-05-07T20:10:48.0503830Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.0504148Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.0504431Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.0504727Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:10:48.0505014Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:10:48.0505333Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.0505677Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:10:48.0506012Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.0506394Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.0506733Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:10:48.0507040Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:48.0507439Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:48.0507932Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:48.0508392Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:48.0509027Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:48.0509753Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.0510754Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:10:48.0512165Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.0512935Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.0513519Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:48.0513934Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:48.0514439Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:10:48.0514973Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:48.0515671Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:10:48.0516107Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:48.0516457Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:48.0516855Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.0517206Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.0517564Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:48.0517930Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:48.0518347Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:10:48.0518762Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.0519156Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.0519554Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.0520395Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0521225Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:10:48.0521551Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:48.0521914Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:48.0522331Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:10:48.0522707Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:10:48.0523130Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.0523511Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.0524264Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.0525191Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.0525919Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0526427Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0526991Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0527471Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:48.0527845Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.0528227Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:48.0528686Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:48.0529241Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.0529686Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:48.0530068Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.0530393Z U stderr@GLIBC_2.2.5 2025-05-07T20:10:48.0530705Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:48.0531015Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.0531305Z U strstr@GLIBC_2.2.5 2025-05-07T20:10:48.0531610Z U tolower@GLIBC_2.2.5 2025-05-07T20:10:48.0531907Z U toupper@GLIBC_2.2.5 2025-05-07T20:10:48.0532304Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:10:48.0532765Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:48.0533162Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:48.0533570Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:48.0533974Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.0534423Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.0534832Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:10:48.0535222Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:10:48.0535615Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.0535961Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.0536295Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.0536604Z w __gmon_start__ 2025-05-07T20:10:48.0536904Z w __pthread_key_create 2025-05-07T20:10:48.0537222Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.0537577Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.0537899Z w pthread_once 2025-05-07T20:10:48.0538192Z w pthread_rwlock_rdlock 2025-05-07T20:10:48.0538500Z w pthread_rwlock_unlock 2025-05-07T20:10:48.0538819Z w pthread_rwlock_wrlock 2025-05-07T20:10:48.0539143Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:10:48.0539503Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.0539934Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:10:48.0540311Z 2025-05-07T20:10:48.0540441Z linux-vdso.so.1 (0x00007ffc54db9000) 2025-05-07T20:10:48.0540746Z libc10.so => not found 2025-05-07T20:10:48.0540991Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.0541270Z libc10_cuda.so => not found 2025-05-07T20:10:48.0541547Z libnccl.so.2 => not found 2025-05-07T20:10:48.0541799Z libcuda.so.1 => not found 2025-05-07T20:10:48.0542375Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f6b150a2000) 2025-05-07T20:10:48.0542952Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.0543241Z libtorch.so => not found 2025-05-07T20:10:48.0543499Z libtorch_cpu.so => not found 2025-05-07T20:10:48.0543966Z libtorch_cuda.so => not found 2025-05-07T20:10:48.0544310Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f6b1479c000) 2025-05-07T20:10:48.0544843Z libm.so.6 => /lib64/libm.so.6 (0x00007f6b14fc5000) 2025-05-07T20:10:48.0545264Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f6b14f97000) 2025-05-07T20:10:48.0545646Z libc.so.6 => /lib64/libc.so.6 (0x00007f6b14594000) 2025-05-07T20:10:48.0546018Z /lib64/ld-linux-x86-64.so.2 (0x00007f6b1511f000) 2025-05-07T20:10:48.0546345Z libtorch.so => not found 2025-05-07T20:10:48.0546612Z libc10.so => not found 2025-05-07T20:10:48.0546863Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.0547149Z libc10_cuda.so => not found 2025-05-07T20:10:48.0547413Z libnccl.so.2 => not found 2025-05-07T20:10:48.0547682Z libcuda.so.1 => not found 2025-05-07T20:10:48.0547943Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.0548234Z libtorch_cpu.so => not found 2025-05-07T20:10:48.0548518Z libtorch_cuda.so => not found 2025-05-07T20:10:48.0548842Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f6b1453e000) 2025-05-07T20:10:48.0549246Z librt.so.1 => /lib64/librt.so.1 (0x00007f6b14f8e000) 2025-05-07T20:10:48.0549659Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f6b14f89000) 2025-05-07T20:10:48.0549959Z 2025-05-07T20:10:48.0550075Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.0550456Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so 2025-05-07T20:10:48.0550755Z 2025-05-07T20:10:48.0566263Z 2025-05-07T20:10:48.0566983Z Dynamic section at offset 0x54d6c8 contains 40 entries: 2025-05-07T20:10:48.0568249Z Tag Type Name/Value 2025-05-07T20:10:48.0568920Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.0569678Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.0570256Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.0570800Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.0571369Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.0571903Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:10:48.0572484Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.0573098Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.0573668Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.0574252Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.0574813Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.0575376Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:48.0575890Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.0576420Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.0576948Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.0577501Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:10:48.0578019Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.0578440Z 0x000000000000000c (INIT) 0xff000 2025-05-07T20:10:48.0578805Z 0x000000000000000d (FINI) 0x4c1c58 2025-05-07T20:10:48.0579153Z 0x0000000000000019 (INIT_ARRAY) 0x54a1c0 2025-05-07T20:10:48.0579537Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:10:48.0579898Z 0x000000000000001a (FINI_ARRAY) 0x54a688 2025-05-07T20:10:48.0580322Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.0580801Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:48.0581165Z 0x0000000000000005 (STRTAB) 0x26de0 2025-05-07T20:10:48.0581701Z 0x0000000000000006 (SYMTAB) 0x9da0 2025-05-07T20:10:48.0582075Z 0x000000000000000a (STRSZ) 754246 (bytes) 2025-05-07T20:10:48.0582485Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.0582857Z 0x0000000000000003 (PLTGOT) 0x551fe8 2025-05-07T20:10:48.0583290Z 0x0000000000000002 (PLTRELSZ) 25992 (bytes) 2025-05-07T20:10:48.0583666Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.0584038Z 0x0000000000000017 (JMPREL) 0xf8458 2025-05-07T20:10:48.0584417Z 0x0000000000000007 (RELA) 0xe1838 2025-05-07T20:10:48.0584791Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:10:48.0585203Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.0585579Z 0x000000006ffffffe (VERNEED) 0xe16d8 2025-05-07T20:10:48.0585973Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.0586325Z 0x000000006ffffff0 (VERSYM) 0xdf026 2025-05-07T20:10:48.0586705Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:10:48.0587034Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.0587272Z 2025-05-07T20:10:48.0587395Z ################################################################################ 2025-05-07T20:10:48.0587637Z 2025-05-07T20:10:48.0587643Z 2025-05-07T20:10:48.0587786Z ################################################################################ 2025-05-07T20:10:48.0588303Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:48.0588809Z [CHECK] Listing out library size: 2025-05-07T20:10:48.0589272Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:48.0589697Z 2025-05-07T20:10:48.0589916Z 3 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:48.0590227Z 2025-05-07T20:10:48.0590640Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:48.0591740Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.0592353Z 2025-05-07T20:10:48.0637854Z GLIBC_2.2.5 2025-05-07T20:10:48.0638525Z GLIBC_2.14 2025-05-07T20:10:48.0639165Z 2025-05-07T20:10:48.0639381Z 2025-05-07T20:10:48.0641386Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:48.0644470Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.0645905Z 2025-05-07T20:10:48.0703482Z GLIBCXX_3.4 2025-05-07T20:10:48.0703859Z GLIBCXX_3.4.9 2025-05-07T20:10:48.0704389Z GLIBCXX_3.4.14 2025-05-07T20:10:48.0704614Z GLIBCXX_3.4.20 2025-05-07T20:10:48.0704821Z GLIBCXX_3.4.21 2025-05-07T20:10:48.0704961Z 2025-05-07T20:10:48.0704966Z 2025-05-07T20:10:48.0720921Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.9lQplR203H.symbols.txt 2025-05-07T20:10:48.0722350Z 2025-05-07T20:10:48.0751424Z 2025-05-07T20:10:48.0786727Z [CHECK] Total Number of symbols: 550 2025-05-07T20:10:48.0799727Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:10:48.0820831Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.eiIxIXmB0l.usymbols.txt 2025-05-07T20:10:48.0822391Z 2025-05-07T20:10:48.0839077Z 2025-05-07T20:10:48.0867547Z [CHECK] Listing out undefined symbols (179 total): 2025-05-07T20:10:48.0887203Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.0888150Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.0888561Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.0888987Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.0889441Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.0889864Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.0890256Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.0890849Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.0891234Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.0891631Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.0891964Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.0892314Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.0892668Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.0893000Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.0893360Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.0893692Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.0894051Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.0894386Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.0894725Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.0895046Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.0895662Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:48.0896388Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:48.0896836Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:48.0897780Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.0898764Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:10:48.0899224Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.0899702Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.0900362Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:48.0901520Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:48.0902351Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:48.0903161Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.0903982Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:48.0904329Z U at::get_num_threads() 2025-05-07T20:10:48.0904670Z U at::get_thread_num() 2025-05-07T20:10:48.0904985Z U at::internal::set_thread_num(int) 2025-05-07T20:10:48.0905373Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:48.0905720Z U c10::BoolType::get() 2025-05-07T20:10:48.0906107Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.0906762Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.0907336Z U c10::Error::what() const 2025-05-07T20:10:48.0907725Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.0908191Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.0908651Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.0909032Z U c10::IntType::get() 2025-05-07T20:10:48.0909394Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.0909822Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.0910294Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.0910805Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:48.0911284Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:48.0911848Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:48.0912372Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.0913074Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.0913793Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.0914243Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:48.0914634Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.0915029Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:48.0915401Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.0915763Z U c10::SymIntType::get() 2025-05-07T20:10:48.0916144Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:48.0916555Z U c10::TensorType::get() 2025-05-07T20:10:48.0916927Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.0917999Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.0918980Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.0919365Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:48.0919889Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:48.0920584Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:48.0921149Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.0921496Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.0921821Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.0922304Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.0922646Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.0923109Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.0923587Z U c10::cuda::device_count() 2025-05-07T20:10:48.0923928Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.0924332Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.0924739Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.0925118Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.0925545Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.0925927Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.0926666Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.0927558Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.0928388Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.0929303Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.0930329Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.0931100Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.0931443Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.0931776Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.0932142Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:48.0932514Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:48.0932877Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.0933281Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.0933658Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:48.0934053Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:48.0934400Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:48.0934761Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.0935185Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.0935612Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.0935997Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.0936355Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.0936753Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.0937094Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.0937447Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.0937810Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.0938157Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.0938519Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.0938854Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.0939209Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.0939589Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.0939965Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.0940937Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0942504Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0944090Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0945673Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0947345Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0949018Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0950816Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0952889Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0954755Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0956588Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0958507Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0960232Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.0961414Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:10:48.0961844Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:48.0962339Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:48.0962838Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.0963265Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.0963685Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.0964089Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.0964531Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.0965303Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.0965823Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.0966241Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.0966558Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.0966907Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.0967235Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:10:48.0967605Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:10:48.0967960Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.0968375Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.0968989Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.0969905Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.0970585Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.0970982Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:48.0971492Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.0971939Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.0972394Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.0972964Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.0973978Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0974860Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.0975276Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.0975655Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.0976054Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.0976417Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.0976863Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0977453Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.0978050Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.0978591Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:10:48.0979484Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:48.0980573Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:10:48.0981326Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.0981640Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.0981977Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.0982786Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.0983905Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.0984775Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.0985514Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.0986085Z U typeinfo for c10::Error 2025-05-07T20:10:48.0986463Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:48.0986883Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.0987358Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.0987801Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.0988220Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.0988608Z U vtable for c10::Error 2025-05-07T20:10:48.0989137Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.0989829Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.0990283Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.0990634Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.0990983Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.0991413Z w __gmon_start__ 2025-05-07T20:10:48.0991895Z w __pthread_key_create 2025-05-07T20:10:48.0992275Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.0992792Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:48.0993129Z 2025-05-07T20:10:48.0993316Z linux-vdso.so.1 (0x00007fff267b2000) 2025-05-07T20:10:48.0993633Z libc10.so => not found 2025-05-07T20:10:48.0993961Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.0994251Z libc10_cuda.so => not found 2025-05-07T20:10:48.0994565Z libnccl.so.2 => not found 2025-05-07T20:10:48.0994839Z libcuda.so.1 => not found 2025-05-07T20:10:48.0995415Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007f931ca00000) 2025-05-07T20:10:48.0996382Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007f931d439000) 2025-05-07T20:10:48.0997067Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.0997386Z libtorch.so => not found 2025-05-07T20:10:48.0997664Z libtorch_cpu.so => not found 2025-05-07T20:10:48.0997982Z libtorch_cuda.so => not found 2025-05-07T20:10:48.0998274Z libcudart.so.12 => not found 2025-05-07T20:10:48.0998653Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f931c79c000) 2025-05-07T20:10:48.0999121Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f931d3e1000) 2025-05-07T20:10:48.0999548Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f931d3b3000) 2025-05-07T20:10:48.0999977Z libc.so.6 => /lib64/libc.so.6 (0x00007f931c594000) 2025-05-07T20:10:48.1000320Z libc10.so => not found 2025-05-07T20:10:48.1000607Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.1000887Z libc10_cuda.so => not found 2025-05-07T20:10:48.1001184Z libnccl.so.2 => not found 2025-05-07T20:10:48.1001457Z libcuda.so.1 => not found 2025-05-07T20:10:48.1002057Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f931d33a000) 2025-05-07T20:10:48.1002673Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.1002971Z libtorch.so => not found 2025-05-07T20:10:48.1003266Z libtorch_cpu.so => not found 2025-05-07T20:10:48.1003552Z libtorch_cuda.so => not found 2025-05-07T20:10:48.1003997Z libm.so.6 => /lib64/libm.so.6 (0x00007f931c4b9000) 2025-05-07T20:10:48.1004351Z /lib64/ld-linux-x86-64.so.2 (0x00007f931d44a000) 2025-05-07T20:10:48.1004693Z libtorch.so => not found 2025-05-07T20:10:48.1004945Z libc10.so => not found 2025-05-07T20:10:48.1005218Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.1005526Z libc10_cuda.so => not found 2025-05-07T20:10:48.1005823Z libnccl.so.2 => not found 2025-05-07T20:10:48.1006110Z libcuda.so.1 => not found 2025-05-07T20:10:48.1006371Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.1006674Z libtorch_cpu.so => not found 2025-05-07T20:10:48.1006944Z libtorch_cuda.so => not found 2025-05-07T20:10:48.1007307Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f931d32f000) 2025-05-07T20:10:48.1007679Z libtorch.so => not found 2025-05-07T20:10:48.1007954Z libc10.so => not found 2025-05-07T20:10:48.1008199Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.1008486Z libc10_cuda.so => not found 2025-05-07T20:10:48.1008760Z libnccl.so.2 => not found 2025-05-07T20:10:48.1009044Z libcuda.so.1 => not found 2025-05-07T20:10:48.1009332Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.1009694Z libtorch_cpu.so => not found 2025-05-07T20:10:48.1009992Z libtorch_cuda.so => not found 2025-05-07T20:10:48.1010298Z librt.so.1 => /lib64/librt.so.1 (0x00007f931d326000) 2025-05-07T20:10:48.1010543Z 2025-05-07T20:10:48.1010684Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.1011114Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:10:48.1011477Z 2025-05-07T20:10:48.1011480Z 2025-05-07T20:10:48.1011642Z Dynamic section at offset 0x2b5a90 contains 41 entries: 2025-05-07T20:10:48.1012085Z Tag Type Name/Value 2025-05-07T20:10:48.1012496Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.1013025Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.1013531Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.1014061Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.1014593Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.1015111Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:48.1015650Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:48.1016231Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.1016769Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.1017273Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.1017813Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.1018328Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.1018867Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.1019398Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:48.1019899Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.1020410Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.1020913Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:48.1021436Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.1021831Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:10:48.1022211Z 0x000000000000000d (FINI) 0x6243c 2025-05-07T20:10:48.1022564Z 0x0000000000000019 (INIT_ARRAY) 0x2b5a40 2025-05-07T20:10:48.1022903Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:10:48.1023261Z 0x000000000000001a (FINI_ARRAY) 0x2b5a88 2025-05-07T20:10:48.1023596Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.1023958Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.1024280Z 0x0000000000000005 (STRTAB) 0x40a0 2025-05-07T20:10:48.1024628Z 0x0000000000000006 (SYMTAB) 0xcf8 2025-05-07T20:10:48.1024965Z 0x000000000000000a (STRSZ) 48233 (bytes) 2025-05-07T20:10:48.1025369Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.1025737Z 0x0000000000000003 (PLTGOT) 0x2b6fe8 2025-05-07T20:10:48.1026083Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:10:48.1026452Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.1026774Z 0x0000000000000017 (JMPREL) 0x13a68 2025-05-07T20:10:48.1027123Z 0x0000000000000007 (RELA) 0x10258 2025-05-07T20:10:48.1027465Z 0x0000000000000008 (RELASZ) 14352 (bytes) 2025-05-07T20:10:48.1027845Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.1028209Z 0x000000006ffffffe (VERNEED) 0x10158 2025-05-07T20:10:48.1028537Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.1028884Z 0x000000006ffffff0 (VERSYM) 0xfd0a 2025-05-07T20:10:48.1029208Z 0x000000006ffffff9 (RELACOUNT) 337 2025-05-07T20:10:48.1029541Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.1029742Z 2025-05-07T20:10:48.1029860Z ################################################################################ 2025-05-07T20:10:48.1030100Z 2025-05-07T20:10:48.1030103Z 2025-05-07T20:10:48.1030217Z ################################################################################ 2025-05-07T20:10:48.1030712Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:48.1031277Z [CHECK] Listing out library size: 2025-05-07T20:10:48.1031751Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:48.1032297Z 2025-05-07T20:10:48.1032540Z 21 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:48.1032883Z 2025-05-07T20:10:48.1033276Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:48.1034351Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.1034948Z 2025-05-07T20:10:48.1083028Z GLIBC_2.2.5 2025-05-07T20:10:48.1083716Z GLIBC_2.14 2025-05-07T20:10:48.1084080Z 2025-05-07T20:10:48.1084094Z 2025-05-07T20:10:48.1085025Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:48.1086124Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.1086762Z 2025-05-07T20:10:48.1163369Z GLIBCXX_3.4 2025-05-07T20:10:48.1163665Z GLIBCXX_3.4.9 2025-05-07T20:10:48.1163949Z GLIBCXX_3.4.11 2025-05-07T20:10:48.1166431Z GLIBCXX_3.4.20 2025-05-07T20:10:48.1167140Z GLIBCXX_3.4.21 2025-05-07T20:10:48.1167573Z 2025-05-07T20:10:48.1167588Z 2025-05-07T20:10:48.1187902Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.6Ir8zssIA5.symbols.txt 2025-05-07T20:10:48.1189381Z 2025-05-07T20:10:48.1237336Z 2025-05-07T20:10:48.1267168Z [CHECK] Total Number of symbols: 783 2025-05-07T20:10:48.1281818Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:10:48.1302850Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.sEUE6cGN8N.usymbols.txt 2025-05-07T20:10:48.1304636Z 2025-05-07T20:10:48.1326726Z 2025-05-07T20:10:48.1353901Z [CHECK] Listing out undefined symbols (147 total): 2025-05-07T20:10:48.1375415Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.1377236Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.1378303Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.1379310Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.1379758Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.1380153Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.1380711Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.1381111Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.1381494Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.1381884Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.1382220Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.1382575Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.1395486Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.1395852Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.1396229Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.1396570Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.1396938Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.1397354Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:48.1397785Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:48.1398587Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.1399882Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.1401297Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.1403905Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:48.1404936Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.1405906Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:48.1406634Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:48.1407693Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.1408802Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.1409649Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:48.1410090Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:48.1410436Z U c10::BoolType::get() 2025-05-07T20:10:48.1410816Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.1411203Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:48.1411618Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1412110Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.1412454Z U c10::IntType::get() 2025-05-07T20:10:48.1412877Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.1413356Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.1413789Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.1414462Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.1415117Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.1415500Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.1415876Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.1416265Z U c10::TensorType::get() 2025-05-07T20:10:48.1416621Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.1417528Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.1418462Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.1418812Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.1419179Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.1419544Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.1419879Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.1420236Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.1420695Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.1421181Z U c10::cuda::current_device() 2025-05-07T20:10:48.1421514Z U c10::cuda::device_count() 2025-05-07T20:10:48.1421885Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.1422282Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.1422839Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.1423272Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.1423713Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.1424253Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.1425205Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.1426119Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.1427035Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.1428037Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.1429096Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.1429963Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.1430320Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.1430723Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:48.1431195Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:48.1431712Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.1432170Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.1432633Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.1433040Z U c10::throwNullDataPtrError() 2025-05-07T20:10:48.1433415Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.1433761Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:48.1434225Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.1434683Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.1435114Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:48.1435502Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.1435922Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.1436333Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.1436713Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.1437114Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.1437465Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.1437852Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.1438230Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:48.1438622Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:48.1438986Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:48.1439385Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.1439767Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.1440125Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.1440507Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:48.1440861Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:48.1441417Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:48.1442014Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:48.1442401Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:48.1442788Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.1443164Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.1443569Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.1444069Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1444517Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.1444899Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1445269Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:48.1445657Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.1446069Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.1446479Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1446839Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.1447159Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.1447448Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.1447778Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.1448138Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.1448697Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.1449528Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.1450128Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.1450517Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.1450921Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.1451360Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.1451792Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.1452236Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.1453143Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.1453926Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.1454297Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.1454668Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.1455007Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.1455426Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.1455975Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.1456415Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.1456748Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.1457060Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.1457877Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.1459002Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.1459797Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.1460735Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.1461468Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1462141Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.1462614Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.1463066Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.1463748Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.1464473Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.1465151Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.1465533Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.1465902Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.1466261Z w __gmon_start__ 2025-05-07T20:10:48.1466586Z w __pthread_key_create 2025-05-07T20:10:48.1466912Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.1467275Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.1467664Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.1468158Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:48.1468482Z 2025-05-07T20:10:48.1468667Z linux-vdso.so.1 (0x00007ffc199fc000) 2025-05-07T20:10:48.1468985Z libtorch.so => not found 2025-05-07T20:10:48.1469284Z libc10.so => not found 2025-05-07T20:10:48.1469551Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.1469856Z libc10_cuda.so => not found 2025-05-07T20:10:48.1470132Z libnccl.so.2 => not found 2025-05-07T20:10:48.1470437Z libcuda.so.1 => not found 2025-05-07T20:10:48.1470716Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.1471088Z libtorch_cpu.so => not found 2025-05-07T20:10:48.1471475Z libtorch_cuda.so => not found 2025-05-07T20:10:48.1471764Z libcudart.so.12 => not found 2025-05-07T20:10:48.1472137Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff3caf9c000) 2025-05-07T20:10:48.1472555Z libm.so.6 => /lib64/libm.so.6 (0x00007ff3cc9a8000) 2025-05-07T20:10:48.1472982Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007ff3cc952000) 2025-05-07T20:10:48.1473404Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff3cc924000) 2025-05-07T20:10:48.1473829Z libc.so.6 => /lib64/libc.so.6 (0x00007ff3cad94000) 2025-05-07T20:10:48.1474209Z /lib64/ld-linux-x86-64.so.2 (0x00007ff3cca8b000) 2025-05-07T20:10:48.1474526Z 2025-05-07T20:10:48.1474646Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.1475111Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:10:48.1475455Z 2025-05-07T20:10:48.1475460Z 2025-05-07T20:10:48.1475630Z Dynamic section at offset 0x14b76f0 contains 39 entries: 2025-05-07T20:10:48.1476052Z Tag Type Name/Value 2025-05-07T20:10:48.1476493Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.1477036Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.1477581Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.1478114Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.1478660Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.1479183Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.1479753Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.1480298Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.1480861Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.1481426Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.1481993Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.1482518Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:48.1483035Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:48.1483577Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.1484095Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.1484690Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:48.1485180Z 0x000000000000000c (INIT) 0x2d000 2025-05-07T20:10:48.1485523Z 0x000000000000000d (FINI) 0xd6d2c 2025-05-07T20:10:48.1485895Z 0x0000000000000019 (INIT_ARRAY) 0x14b5318 2025-05-07T20:10:48.1486329Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:10:48.1486731Z 0x000000000000001a (FINI_ARRAY) 0x14b53e8 2025-05-07T20:10:48.1487103Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.1487476Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.1487825Z 0x0000000000000005 (STRTAB) 0x5fa8 2025-05-07T20:10:48.1488170Z 0x0000000000000006 (SYMTAB) 0x1628 2025-05-07T20:10:48.1488521Z 0x000000000000000a (STRSZ) 113302 (bytes) 2025-05-07T20:10:48.1488908Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.1489261Z 0x0000000000000003 (PLTGOT) 0x14b7fe8 2025-05-07T20:10:48.1489638Z 0x0000000000000002 (PLTRELSZ) 10368 (bytes) 2025-05-07T20:10:48.1489989Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.1490336Z 0x0000000000000017 (JMPREL) 0x29e58 2025-05-07T20:10:48.1490672Z 0x0000000000000007 (RELA) 0x22160 2025-05-07T20:10:48.1491016Z 0x0000000000000008 (RELASZ) 31992 (bytes) 2025-05-07T20:10:48.1491413Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.1491755Z 0x000000006ffffffe (VERNEED) 0x22060 2025-05-07T20:10:48.1492107Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.1492425Z 0x000000006ffffff0 (VERSYM) 0x21a3e 2025-05-07T20:10:48.1492768Z 0x000000006ffffff9 (RELACOUNT) 498 2025-05-07T20:10:48.1493078Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.1493304Z 2025-05-07T20:10:48.1493421Z ################################################################################ 2025-05-07T20:10:48.1493648Z 2025-05-07T20:10:48.1493653Z 2025-05-07T20:10:48.1493802Z ################################################################################ 2025-05-07T20:10:48.1494307Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:48.1494822Z [CHECK] Listing out library size: 2025-05-07T20:10:48.1495277Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:48.1495677Z 2025-05-07T20:10:48.1495893Z 9 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:48.1496208Z 2025-05-07T20:10:48.1496629Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:48.1497648Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.1498272Z 2025-05-07T20:10:48.1541548Z GLIBC_2.2.5 2025-05-07T20:10:48.1541821Z GLIBC_2.3 2025-05-07T20:10:48.1542035Z GLIBC_2.14 2025-05-07T20:10:48.1542162Z 2025-05-07T20:10:48.1542180Z 2025-05-07T20:10:48.1542614Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:48.1543671Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.1544335Z 2025-05-07T20:10:48.1600148Z GLIBCXX_3.4 2025-05-07T20:10:48.1600468Z GLIBCXX_3.4.9 2025-05-07T20:10:48.1601889Z GLIBCXX_3.4.11 2025-05-07T20:10:48.1602583Z GLIBCXX_3.4.18 2025-05-07T20:10:48.1603196Z GLIBCXX_3.4.21 2025-05-07T20:10:48.1603575Z 2025-05-07T20:10:48.1603591Z 2025-05-07T20:10:48.1624636Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.5pue5hcPcX.symbols.txt 2025-05-07T20:10:48.1626137Z 2025-05-07T20:10:48.1652453Z 2025-05-07T20:10:48.1677717Z [CHECK] Total Number of symbols: 347 2025-05-07T20:10:48.1693237Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:10:48.1711007Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.0wmPsRFaMc.usymbols.txt 2025-05-07T20:10:48.1712786Z 2025-05-07T20:10:48.1728041Z 2025-05-07T20:10:48.1753392Z [CHECK] Listing out undefined symbols (124 total): 2025-05-07T20:10:48.1773387Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.1774244Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.1774816Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.1775173Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.1775590Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.1775981Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.1776377Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.1776760Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.1777139Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.1777540Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.1777900Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.1778392Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.1778720Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.1779058Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.1779376Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:48.1779733Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.1780059Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.1780416Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:48.1780845Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:48.1781436Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.1782066Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:48.1782447Z U c10::BoolType::get() 2025-05-07T20:10:48.1782892Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.1783270Z U c10::FloatType::get() 2025-05-07T20:10:48.1783593Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:48.1784018Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1784477Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.1784841Z U c10::IntType::get() 2025-05-07T20:10:48.1785236Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.1785651Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.1786263Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.1786751Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.1787398Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.1788031Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.1788446Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.1788771Z U c10::TensorType::get() 2025-05-07T20:10:48.1789081Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.1789986Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.1791013Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.1791443Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.1791957Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.1792327Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.1792699Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.1793068Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.1793553Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.1794053Z U c10::cuda::device_count() 2025-05-07T20:10:48.1794425Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.1794819Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.1795234Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.1795637Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.1796072Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.1796477Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.1797232Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.1798180Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.1799054Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.1800026Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.1801095Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.1801952Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.1802312Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.1802670Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.1803049Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.1803569Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.1804029Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.1804576Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.1805009Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.1805359Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.1805719Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.1806055Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.1806404Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.1806722Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.1807073Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.1807425Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:48.1807790Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.1808160Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.1808482Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.1808817Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.1809145Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.1809496Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.1809823Z U float at::Tensor::item() const 2025-05-07T20:10:48.1810229Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1810638Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1811023Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.1811377Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.1811645Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.1811928Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.1812214Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.1812548Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.1813103Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.1813977Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.1814778Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.1815549Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.1816337Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.1816684Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.1817081Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.1817480Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.1817866Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.1818527Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.1819491Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.1820379Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.1820749Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.1821117Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.1821458Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.1821881Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.1822427Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.1822928Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.1823290Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.1823603Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.1823934Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.1824768Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.1825955Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.1826835Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.1827583Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.1828229Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.1828656Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.1829101Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.1830761Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.1831541Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.1832021Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.1832393Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.1832724Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.1833038Z w __gmon_start__ 2025-05-07T20:10:48.1833315Z w __pthread_key_create 2025-05-07T20:10:48.1833633Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.1833964Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.1834349Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.1834823Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:48.1835171Z 2025-05-07T20:10:48.1835310Z linux-vdso.so.1 (0x00007ffec379d000) 2025-05-07T20:10:48.1835627Z libtorch.so => not found 2025-05-07T20:10:48.1835878Z libc10.so => not found 2025-05-07T20:10:48.1836141Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.1836403Z libc10_cuda.so => not found 2025-05-07T20:10:48.1836678Z libnccl.so.2 => not found 2025-05-07T20:10:48.1836931Z libcuda.so.1 => not found 2025-05-07T20:10:48.1837203Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.1837518Z libtorch_cpu.so => not found 2025-05-07T20:10:48.1837801Z libtorch_cuda.so => not found 2025-05-07T20:10:48.1838074Z libcudart.so.12 => not found 2025-05-07T20:10:48.1838420Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0b5a79c000) 2025-05-07T20:10:48.1838859Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f0b5b34f000) 2025-05-07T20:10:48.1839268Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0b5b321000) 2025-05-07T20:10:48.1839669Z libc.so.6 => /lib64/libc.so.6 (0x00007f0b5a594000) 2025-05-07T20:10:48.1840037Z /lib64/ld-linux-x86-64.so.2 (0x00007f0b5b3ad000) 2025-05-07T20:10:48.1840414Z libm.so.6 => /lib64/libm.so.6 (0x00007f0b5a4b9000) 2025-05-07T20:10:48.1840684Z 2025-05-07T20:10:48.1840801Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.1841266Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:10:48.1841628Z 2025-05-07T20:10:48.1853177Z 2025-05-07T20:10:48.1853859Z Dynamic section at offset 0x8a7a10 contains 39 entries: 2025-05-07T20:10:48.1854261Z Tag Type Name/Value 2025-05-07T20:10:48.1854701Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.1855280Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.1855787Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.1856318Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.1856835Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.1857363Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.1857892Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.1858432Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.1858984Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.1859560Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.1860100Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.1860622Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:48.1861156Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.1861682Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.1862233Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.1862826Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:48.1863302Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:48.1863657Z 0x000000000000000d (FINI) 0x333cc 2025-05-07T20:10:48.1863999Z 0x0000000000000019 (INIT_ARRAY) 0x8a71f8 2025-05-07T20:10:48.1864371Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:10:48.1864883Z 0x000000000000001a (FINI_ARRAY) 0x8a7228 2025-05-07T20:10:48.1865254Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.1865618Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:48.1866039Z 0x0000000000000005 (STRTAB) 0x2a78 2025-05-07T20:10:48.1866387Z 0x0000000000000006 (SYMTAB) 0x9d8 2025-05-07T20:10:48.1866754Z 0x000000000000000a (STRSZ) 38407 (bytes) 2025-05-07T20:10:48.1867140Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.1867493Z 0x0000000000000003 (PLTGOT) 0x8a7fe8 2025-05-07T20:10:48.1867876Z 0x0000000000000002 (PLTRELSZ) 4728 (bytes) 2025-05-07T20:10:48.1868250Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.1868578Z 0x0000000000000017 (JMPREL) 0xe230 2025-05-07T20:10:48.1868918Z 0x0000000000000007 (RELA) 0xc448 2025-05-07T20:10:48.1869265Z 0x0000000000000008 (RELASZ) 7656 (bytes) 2025-05-07T20:10:48.1869702Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.1870045Z 0x000000006ffffffe (VERNEED) 0xc338 2025-05-07T20:10:48.1870394Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.1870719Z 0x000000006ffffff0 (VERSYM) 0xc080 2025-05-07T20:10:48.1871064Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:10:48.1871487Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.1871692Z 2025-05-07T20:10:48.1871808Z ################################################################################ 2025-05-07T20:10:48.1872086Z 2025-05-07T20:10:48.1872103Z 2025-05-07T20:10:48.1872216Z ################################################################################ 2025-05-07T20:10:48.1872710Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:48.1873219Z [CHECK] Listing out library size: 2025-05-07T20:10:48.1873695Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:48.1874073Z 2025-05-07T20:10:48.1874287Z 17 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:48.1874609Z 2025-05-07T20:10:48.1875001Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:48.1875993Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.1876602Z 2025-05-07T20:10:48.1924571Z GLIBC_2.2.5 2025-05-07T20:10:48.1925238Z GLIBC_2.14 2025-05-07T20:10:48.1925616Z 2025-05-07T20:10:48.1925630Z 2025-05-07T20:10:48.1926843Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:48.1929876Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.1931819Z 2025-05-07T20:10:48.1987237Z GLIBCXX_3.4 2025-05-07T20:10:48.1987962Z GLIBCXX_3.4.9 2025-05-07T20:10:48.1988226Z GLIBCXX_3.4.20 2025-05-07T20:10:48.1988479Z GLIBCXX_3.4.21 2025-05-07T20:10:48.1988611Z 2025-05-07T20:10:48.1988616Z 2025-05-07T20:10:48.2003602Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.DrK8s2hdcQ.symbols.txt 2025-05-07T20:10:48.2004992Z 2025-05-07T20:10:48.2032223Z 2025-05-07T20:10:48.2057033Z [CHECK] Total Number of symbols: 452 2025-05-07T20:10:48.2068992Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:10:48.2083314Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.BKaCp1JwLG.usymbols.txt 2025-05-07T20:10:48.2084736Z 2025-05-07T20:10:48.2102493Z 2025-05-07T20:10:48.2128232Z [CHECK] Listing out undefined symbols (149 total): 2025-05-07T20:10:48.2143434Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.2144080Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.2144452Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.2144854Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.2145293Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.2145686Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.2146062Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.2146439Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.2146823Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.2147293Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.2147625Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.2147928Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.2148385Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.2148693Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.2149024Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.2149340Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.2149760Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.2150055Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.2150343Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.2150796Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.2151427Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:48.2152464Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2153837Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2154826Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:48.2155284Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.2155771Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.2156288Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:48.2156757Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:48.2157480Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2158744Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2159502Z U c10::BoolType::get() 2025-05-07T20:10:48.2159855Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.2160249Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.2160594Z U c10::IntType::get() 2025-05-07T20:10:48.2160968Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.2161367Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.2161789Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.2162270Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.2162671Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.2163294Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.2163916Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.2164258Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.2164585Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.2165222Z U c10::SymIntType::get() 2025-05-07T20:10:48.2165620Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:48.2166067Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.2166447Z U c10::TensorType::get() 2025-05-07T20:10:48.2166798Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.2167775Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.2168801Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.2169185Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.2169542Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.2169910Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.2170276Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.2170629Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.2171128Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.2171731Z U c10::cuda::current_device() 2025-05-07T20:10:48.2172046Z U c10::cuda::device_count() 2025-05-07T20:10:48.2172375Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.2172752Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.2173139Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.2173512Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.2173912Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.2174276Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.2174990Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.2175838Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.2176652Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.2177564Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.2178580Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.2179513Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.2179889Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.2180248Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:48.2180713Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:48.2181124Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.2181489Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.2181885Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.2182241Z U c10::throwNullDataPtrError() 2025-05-07T20:10:48.2182799Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.2183144Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:48.2183559Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.2184007Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.2184366Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:48.2184754Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.2185125Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.2185511Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.2185881Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.2186231Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.2186584Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.2186937Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.2187348Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:48.2187699Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:48.2188060Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:48.2188426Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:48.2188811Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.2189176Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.2189727Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.2190065Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:48.2190405Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:48.2190896Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:48.2191449Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:48.2191967Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:48.2192344Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.2192700Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.2193076Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.2193438Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.2193838Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.2194190Z U log2@GLIBC_2.2.5 2025-05-07T20:10:48.2194574Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.2195019Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.2195418Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.2195792Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.2196082Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.2196387Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.2196694Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.2197088Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.2197796Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.2198590Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.2199195Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.2199566Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.2199950Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.2200358Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.2200850Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.2201737Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.2202496Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.2202845Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.2203187Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.2203513Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.2203845Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.2204223Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.2204743Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.2205204Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.2206310Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.2206630Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.2206930Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.2207721Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.2208825Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.2209636Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.2210343Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.2210925Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:48.2211297Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.2211709Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.2212291Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.2212900Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.2213563Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.2214226Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.2214578Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.2214891Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.2215254Z w __gmon_start__ 2025-05-07T20:10:48.2215524Z w __pthread_key_create 2025-05-07T20:10:48.2215884Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.2216368Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:48.2216696Z 2025-05-07T20:10:48.2216834Z linux-vdso.so.1 (0x00007fffce9f3000) 2025-05-07T20:10:48.2217145Z libtorch.so => not found 2025-05-07T20:10:48.2217392Z libc10.so => not found 2025-05-07T20:10:48.2217654Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.2217919Z libc10_cuda.so => not found 2025-05-07T20:10:48.2218177Z libnccl.so.2 => not found 2025-05-07T20:10:48.2218441Z libcuda.so.1 => not found 2025-05-07T20:10:48.2218697Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.2219013Z libtorch_cpu.so => not found 2025-05-07T20:10:48.2219288Z libtorch_cuda.so => not found 2025-05-07T20:10:48.2219571Z libcudart.so.12 => not found 2025-05-07T20:10:48.2219900Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2ced39c000) 2025-05-07T20:10:48.2220302Z libm.so.6 => /lib64/libm.so.6 (0x00007f2cee7e5000) 2025-05-07T20:10:48.2220690Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2cee78f000) 2025-05-07T20:10:48.2221199Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2cee761000) 2025-05-07T20:10:48.2221583Z libc.so.6 => /lib64/libc.so.6 (0x00007f2ced194000) 2025-05-07T20:10:48.2221929Z /lib64/ld-linux-x86-64.so.2 (0x00007f2cee8c8000) 2025-05-07T20:10:48.2222171Z 2025-05-07T20:10:48.2222274Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.2222685Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:10:48.2223027Z 2025-05-07T20:10:48.2223031Z 2025-05-07T20:10:48.2223185Z Dynamic section at offset 0x104fa28 contains 39 entries: 2025-05-07T20:10:48.2223557Z Tag Type Name/Value 2025-05-07T20:10:48.2223960Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.2224453Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.2224943Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.2225470Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.2225963Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.2226464Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.2226978Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.2227487Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.2227999Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.2228508Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.2229051Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.2229709Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:48.2230223Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:48.2230735Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.2231297Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.2231831Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:48.2232347Z 0x000000000000000c (INIT) 0x11000 2025-05-07T20:10:48.2232687Z 0x000000000000000d (FINI) 0x8746c 2025-05-07T20:10:48.2233024Z 0x0000000000000019 (INIT_ARRAY) 0x104ff20 2025-05-07T20:10:48.2233390Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:10:48.2233754Z 0x000000000000001a (FINI_ARRAY) 0x104ff80 2025-05-07T20:10:48.2234105Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.2234454Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.2234777Z 0x0000000000000005 (STRTAB) 0x3660 2025-05-07T20:10:48.2235105Z 0x0000000000000006 (SYMTAB) 0xbe8 2025-05-07T20:10:48.2235445Z 0x000000000000000a (STRSZ) 35790 (bytes) 2025-05-07T20:10:48.2235839Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.2236183Z 0x0000000000000003 (PLTGOT) 0x1050fe8 2025-05-07T20:10:48.2236548Z 0x0000000000000002 (PLTRELSZ) 6480 (bytes) 2025-05-07T20:10:48.2236897Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.2237216Z 0x0000000000000017 (JMPREL) 0xf060 2025-05-07T20:10:48.2237545Z 0x0000000000000007 (RELA) 0xc6a8 2025-05-07T20:10:48.2237914Z 0x0000000000000008 (RELASZ) 10680 (bytes) 2025-05-07T20:10:48.2238282Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.2238620Z 0x000000006ffffffe (VERNEED) 0xc5b8 2025-05-07T20:10:48.2238956Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.2239287Z 0x000000006ffffff0 (VERSYM) 0xc22e 2025-05-07T20:10:48.2239609Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:10:48.2239926Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.2240129Z 2025-05-07T20:10:48.2240244Z ################################################################################ 2025-05-07T20:10:48.2240483Z 2025-05-07T20:10:48.2240487Z 2025-05-07T20:10:48.2240599Z ################################################################################ 2025-05-07T20:10:48.2241125Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:48.2241651Z [CHECK] Listing out library size: 2025-05-07T20:10:48.2242141Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:48.2242544Z 2025-05-07T20:10:48.2242788Z 2 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:48.2243132Z 2025-05-07T20:10:48.2243554Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:48.2244699Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.2245274Z 2025-05-07T20:10:48.2277855Z GLIBC_2.2.5 2025-05-07T20:10:48.2278505Z GLIBC_2.14 2025-05-07T20:10:48.2279168Z 2025-05-07T20:10:48.2279232Z 2025-05-07T20:10:48.2280656Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:48.2283990Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.2284777Z 2025-05-07T20:10:48.2336021Z GLIBCXX_3.4 2025-05-07T20:10:48.2336650Z GLIBCXX_3.4.9 2025-05-07T20:10:48.2337176Z GLIBCXX_3.4.21 2025-05-07T20:10:48.2337301Z 2025-05-07T20:10:48.2337305Z 2025-05-07T20:10:48.2353801Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.dkfM7a13Li.symbols.txt 2025-05-07T20:10:48.2354338Z 2025-05-07T20:10:48.2373189Z 2025-05-07T20:10:48.2398420Z [CHECK] Total Number of symbols: 277 2025-05-07T20:10:48.2413467Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:10:48.2431137Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.IVfu3brBQu.usymbols.txt 2025-05-07T20:10:48.2432943Z 2025-05-07T20:10:48.2448724Z 2025-05-07T20:10:48.2475196Z [CHECK] Listing out undefined symbols (127 total): 2025-05-07T20:10:48.2497417Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.2498046Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.2498422Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.2498846Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.2499228Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.2499616Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.2500123Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.2500504Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.2500870Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.2501264Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.2501603Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.2501939Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.2502350Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.2502720Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.2503062Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.2503511Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:48.2504428Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2505823Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2507006Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.2507404Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:48.2508103Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2508775Z U at::get_thread_num() 2025-05-07T20:10:48.2509083Z U at::internal::set_thread_num(int) 2025-05-07T20:10:48.2509838Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.2510790Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:48.2511413Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.2512032Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.2512532Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.2512956Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.2513414Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:48.2513793Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:48.2514193Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.2514606Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.2514995Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.2515376Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:48.2515818Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.2516216Z U c10::TensorType::get() 2025-05-07T20:10:48.2516346Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.2517080Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.2517246Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.2517374Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.2517505Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.2517651Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.2517807Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.2517931Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.2518214Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.2518330Z U c10::cuda::device_count() 2025-05-07T20:10:48.2518473Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.2518645Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.2518826Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.2518978Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.2519145Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.2519286Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.2519822Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.2520092Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.2520627Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.2520981Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.2521141Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.2521262Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.2521417Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:48.2521622Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:48.2521780Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.2521933Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.2522080Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.2522227Z U c10::throwNullDataPtrError() 2025-05-07T20:10:48.2522339Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.2522462Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:48.2522686Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.2522836Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.2522978Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:48.2523118Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.2523256Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.2523373Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.2523511Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.2523626Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.2523742Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.2523869Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.2524110Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:48.2524217Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:48.2524333Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:48.2524459Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.2524566Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.2524673Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.2524950Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:48.2525064Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:48.2525233Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:48.2525343Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.2525473Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.2525587Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.2525717Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.2525850Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.2526040Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.2526171Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.2526283Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.2526376Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.2526467Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.2526577Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.2526714Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.2527119Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.2527528Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.2527650Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.2527788Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.2527920Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.2528159Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.2528705Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.2528863Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.2528979Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.2529092Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.2529201Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.2529389Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.2529495Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.2529592Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.2529745Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.2530303Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.2530743Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.2531006Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.2531350Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.2531509Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.2531665Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.2531817Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.2532144Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.2532359Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.2532471Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.2532597Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.2532714Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.2532801Z w __gmon_start__ 2025-05-07T20:10:48.2532896Z w __pthread_key_create 2025-05-07T20:10:48.2533053Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.2533279Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:48.2533286Z 2025-05-07T20:10:48.2545170Z linux-vdso.so.1 (0x00007ffe2b517000) 2025-05-07T20:10:48.2545485Z libc10.so => not found 2025-05-07T20:10:48.2545778Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.2546092Z libc10_cuda.so => not found 2025-05-07T20:10:48.2546382Z libnccl.so.2 => not found 2025-05-07T20:10:48.2546645Z libcuda.so.1 => not found 2025-05-07T20:10:48.2547515Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f7d60600000) 2025-05-07T20:10:48.2547636Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.2547732Z libtorch.so => not found 2025-05-07T20:10:48.2547832Z libtorch_cpu.so => not found 2025-05-07T20:10:48.2547933Z libtorch_cuda.so => not found 2025-05-07T20:10:48.2548048Z libcudart.so.12 => not found 2025-05-07T20:10:48.2548214Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f7d6039c000) 2025-05-07T20:10:48.2548369Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7d617e9000) 2025-05-07T20:10:48.2548513Z libc.so.6 => /lib64/libc.so.6 (0x00007f7d60194000) 2025-05-07T20:10:48.2548612Z libtorch.so => not found 2025-05-07T20:10:48.2548703Z libc10.so => not found 2025-05-07T20:10:48.2548806Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.2548917Z libc10_cuda.so => not found 2025-05-07T20:10:48.2549011Z libnccl.so.2 => not found 2025-05-07T20:10:48.2549103Z libcuda.so.1 => not found 2025-05-07T20:10:48.2549253Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.2549353Z libtorch_cpu.so => not found 2025-05-07T20:10:48.2549456Z libtorch_cuda.so => not found 2025-05-07T20:10:48.2549554Z libcudart.so.12 => not found 2025-05-07T20:10:48.2549702Z libm.so.6 => /lib64/libm.so.6 (0x00007f7d6170a000) 2025-05-07T20:10:48.2549944Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f7d6013e000) 2025-05-07T20:10:48.2550079Z /lib64/ld-linux-x86-64.so.2 (0x00007f7d619c6000) 2025-05-07T20:10:48.2550185Z 2025-05-07T20:10:48.2550313Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.2550590Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:10:48.2550628Z 2025-05-07T20:10:48.2587647Z 2025-05-07T20:10:48.2588102Z Dynamic section at offset 0x16eba8 contains 39 entries: 2025-05-07T20:10:48.2588259Z Tag Type Name/Value 2025-05-07T20:10:48.2588486Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.2588746Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.2588974Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.2589178Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.2589405Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.2589647Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:48.2589862Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.2590063Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.2590283Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.2590506Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.2590709Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.2590925Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.2591337Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.2591547Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.2591819Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:48.2592006Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.2592121Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:10:48.2592286Z 0x000000000000000d (FINI) 0x1a14c 2025-05-07T20:10:48.2592422Z 0x0000000000000019 (INIT_ARRAY) 0x16f890 2025-05-07T20:10:48.2592557Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:10:48.2592675Z 0x000000000000001a (FINI_ARRAY) 0x16f8b0 2025-05-07T20:10:48.2592812Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.2592929Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.2593044Z 0x0000000000000005 (STRTAB) 0x2108 2025-05-07T20:10:48.2593169Z 0x0000000000000006 (SYMTAB) 0x6f8 2025-05-07T20:10:48.2593304Z 0x000000000000000a (STRSZ) 20443 (bytes) 2025-05-07T20:10:48.2593428Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.2593551Z 0x0000000000000003 (PLTGOT) 0x16ffe8 2025-05-07T20:10:48.2593702Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:10:48.2593811Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.2593928Z 0x0000000000000017 (JMPREL) 0x8150 2025-05-07T20:10:48.2594056Z 0x0000000000000007 (RELA) 0x73d0 2025-05-07T20:10:48.2594187Z 0x0000000000000008 (RELASZ) 3456 (bytes) 2025-05-07T20:10:48.2594309Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.2594441Z 0x000000006ffffffe (VERNEED) 0x7310 2025-05-07T20:10:48.2594551Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:48.2594707Z 0x000000006ffffff0 (VERSYM) 0x70e4 2025-05-07T20:10:48.2594816Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:10:48.2594935Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.2594950Z 2025-05-07T20:10:48.2595068Z ################################################################################ 2025-05-07T20:10:48.2595073Z 2025-05-07T20:10:48.2595077Z 2025-05-07T20:10:48.2595191Z ################################################################################ 2025-05-07T20:10:48.2595642Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:48.2595781Z [CHECK] Listing out library size: 2025-05-07T20:10:48.2596081Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:48.2596085Z 2025-05-07T20:10:48.2610320Z 11 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:48.2613169Z 2025-05-07T20:10:48.2613657Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:48.2614231Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.2614236Z 2025-05-07T20:10:48.3074109Z GLIBC_2.2.5 2025-05-07T20:10:48.3074210Z GLIBC_2.3 2025-05-07T20:10:48.3074309Z GLIBC_2.14 2025-05-07T20:10:48.3075735Z 2025-05-07T20:10:48.3075880Z 2025-05-07T20:10:48.3076708Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:48.3077309Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.3077316Z 2025-05-07T20:10:48.3539187Z GLIBCXX_3.4 2025-05-07T20:10:48.3539341Z GLIBCXX_3.4.9 2025-05-07T20:10:48.3541924Z GLIBCXX_3.4.11 2025-05-07T20:10:48.3542083Z GLIBCXX_3.4.15 2025-05-07T20:10:48.3542168Z GLIBCXX_3.4.18 2025-05-07T20:10:48.3542251Z GLIBCXX_3.4.20 2025-05-07T20:10:48.3542335Z GLIBCXX_3.4.21 2025-05-07T20:10:48.3542341Z 2025-05-07T20:10:48.3542346Z 2025-05-07T20:10:48.3557390Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.swZg8YLAIJ.symbols.txt 2025-05-07T20:10:48.3557397Z 2025-05-07T20:10:48.3963397Z 2025-05-07T20:10:48.3994199Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:10:48.4034850Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:10:48.4058558Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.o9dma5qSm3.usymbols.txt 2025-05-07T20:10:48.4058616Z 2025-05-07T20:10:48.4096015Z 2025-05-07T20:10:48.4130324Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:10:48.4148733Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.4149593Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.4149783Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.4149917Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.4150043Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.4150178Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.4150305Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.4150444Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.4150557Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.4150682Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.4150791Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.4150914Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.4151195Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.4151420Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.4151527Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.4151733Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:48.4151869Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:48.4151982Z U at::RecordFunction::end() 2025-05-07T20:10:48.4152113Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:48.4152282Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:48.4152663Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:48.4152976Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:48.4153327Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:48.4153548Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:48.4154222Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.4154421Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.4154590Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.4154738Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:48.4154903Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:48.4155034Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:48.4155153Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:48.4155297Z U c10::AnyType::get() 2025-05-07T20:10:48.4155400Z U c10::BoolType::get() 2025-05-07T20:10:48.4155670Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.4155859Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:48.4155969Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:48.4156529Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:48.4157143Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:48.4157496Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.4157617Z U c10::Error::what() const 2025-05-07T20:10:48.4157716Z U c10::FloatType::get() 2025-05-07T20:10:48.4157820Z U c10::GradMode::is_enabled() 2025-05-07T20:10:48.4157929Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:48.4158100Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:48.4158213Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:48.4158323Z U c10::IValue::isBoolList() const 2025-05-07T20:10:48.4158454Z U c10::IValue::isDoubleList() const 2025-05-07T20:10:48.4158563Z U c10::IValue::isIntList() const 2025-05-07T20:10:48.4158671Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:48.4158790Z U c10::IValue::isTensorList() const 2025-05-07T20:10:48.4158926Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.4159043Z U c10::IntType::get() 2025-05-07T20:10:48.4159513Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.4159732Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.4159847Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.4159969Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.4160092Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.4160344Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.4160610Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:48.4160713Z U c10::StringType::get() 2025-05-07T20:10:48.4160867Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:48.4161006Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.4161147Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.4161309Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:48.4161692Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.4161825Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.4161964Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:48.4162098Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:48.4162215Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.4162353Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:48.4162458Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.4162559Z U c10::SymIntType::get() 2025-05-07T20:10:48.4162714Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:48.4162812Z U c10::TensorType::get() 2025-05-07T20:10:48.4162928Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.4163346Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.4163860Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.4164108Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.4164586Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.4165271Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.4165871Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.4166220Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:48.4166410Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:48.4166556Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.4166714Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:48.4167094Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.4167281Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:48.4167449Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:48.4167602Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:48.4167751Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:48.4167962Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.4168088Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.4168353Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:48.4168698Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.4168797Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.4168980Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.4169095Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:48.4169193Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.4169294Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.4169403Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.4169526Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.4169653Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.4169753Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:48.4169990Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:48.4170338Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.4170755Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.4171095Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.4171619Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.4171987Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.4172104Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.4172217Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.4172403Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.4172543Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.4172708Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.4172851Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.4172989Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:48.4173221Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.4173778Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.4173903Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.4174022Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.4174151Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.4174268Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.4174377Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.4174563Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.4174789Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.4174938Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.4175112Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.4175239Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:48.4175647Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.4175797Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:48.4175929Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.4176026Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:48.4176120Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.4176256Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.4176813Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.4177267Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.4177515Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.4177637Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:48.4177937Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:48.4178117Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:48.4178314Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:48.4178506Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:48.4178862Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:48.4179014Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:48.4179209Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:48.4179382Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:48.4179503Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:48.4179655Z U torch::autograd::Node::metadata() 2025-05-07T20:10:48.4179792Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:48.4193069Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:48.4193448Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:48.4193617Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:48.4193863Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:48.4194093Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:48.4196816Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:48.4197076Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:48.4197234Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:48.4197425Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:48.4198299Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:48.4198502Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:48.4198892Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:48.4199241Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.4199360Z U typeinfo for c10::Error 2025-05-07T20:10:48.4199500Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.4199624Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:48.4199767Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:48.4199897Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:48.4200018Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:48.4200166Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.4200340Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.4200494Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:48.4200676Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.4200794Z U vtable for c10::Error 2025-05-07T20:10:48.4201113Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.4201244Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.4201479Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.4201623Z U vtable for torch::autograd::Node 2025-05-07T20:10:48.4201799Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.4201923Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.4202030Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.4202133Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.4202237Z w __gmon_start__ 2025-05-07T20:10:48.4202334Z w __pthread_key_create 2025-05-07T20:10:48.4202447Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.4202557Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.4202718Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.4202966Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:48.4202973Z 2025-05-07T20:10:48.4213569Z linux-vdso.so.1 (0x00007fffe6154000) 2025-05-07T20:10:48.4213809Z libc10.so => not found 2025-05-07T20:10:48.4213966Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4214128Z libc10_cuda.so => not found 2025-05-07T20:10:48.4214244Z libnccl.so.2 => not found 2025-05-07T20:10:48.4214344Z libcuda.so.1 => not found 2025-05-07T20:10:48.4214867Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f2c98c00000) 2025-05-07T20:10:48.4215378Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f2c98800000) 2025-05-07T20:10:48.4216041Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f2c98659000) 2025-05-07T20:10:48.4216152Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4216275Z libtorch.so => not found 2025-05-07T20:10:48.4216734Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007f2c9afd1000) 2025-05-07T20:10:48.4216838Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4218310Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4218483Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2c983f5000) 2025-05-07T20:10:48.4218641Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2c9afa3000) 2025-05-07T20:10:48.4218776Z libc.so.6 => /lib64/libc.so.6 (0x00007f2c981ed000) 2025-05-07T20:10:48.4218930Z /lib64/ld-linux-x86-64.so.2 (0x00007f2c9afe4000) 2025-05-07T20:10:48.4219138Z libtorch.so => not found 2025-05-07T20:10:48.4219230Z libc10.so => not found 2025-05-07T20:10:48.4219347Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4219579Z libc10_cuda.so => not found 2025-05-07T20:10:48.4219674Z libnccl.so.2 => not found 2025-05-07T20:10:48.4219781Z libcuda.so.1 => not found 2025-05-07T20:10:48.4219878Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4219973Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4220083Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4220181Z libcudart.so.12 => not found 2025-05-07T20:10:48.4220306Z libm.so.6 => /lib64/libm.so.6 (0x00007f2c9aec4000) 2025-05-07T20:10:48.4220472Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2c9ae6c000) 2025-05-07T20:10:48.4220561Z libc10.so => not found 2025-05-07T20:10:48.4220656Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4220744Z libc10_cuda.so => not found 2025-05-07T20:10:48.4220853Z libnccl.so.2 => not found 2025-05-07T20:10:48.4220948Z libcuda.so.1 => not found 2025-05-07T20:10:48.4221332Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007f2c97c00000) 2025-05-07T20:10:48.4221450Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4221545Z libtorch.so => not found 2025-05-07T20:10:48.4221638Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4221735Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4221838Z libcudart.so.12 => not found 2025-05-07T20:10:48.4221927Z libc10.so => not found 2025-05-07T20:10:48.4222021Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4222156Z libc10_cuda.so => not found 2025-05-07T20:10:48.4222250Z libnccl.so.2 => not found 2025-05-07T20:10:48.4222337Z libcuda.so.1 => not found 2025-05-07T20:10:48.4222774Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f2c96a00000) 2025-05-07T20:10:48.4222888Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4222979Z libtorch.so => not found 2025-05-07T20:10:48.4223074Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4223188Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4223281Z libcudart.so.12 => not found 2025-05-07T20:10:48.4223371Z libtorch.so => not found 2025-05-07T20:10:48.4223458Z libc10.so => not found 2025-05-07T20:10:48.4223565Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4223656Z libc10_cuda.so => not found 2025-05-07T20:10:48.4223746Z libnccl.so.2 => not found 2025-05-07T20:10:48.4223852Z libcuda.so.1 => not found 2025-05-07T20:10:48.4223950Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4224045Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4224140Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4224329Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2c9ae5b000) 2025-05-07T20:10:48.4224416Z libc10.so => not found 2025-05-07T20:10:48.4224508Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4224616Z libc10_cuda.so => not found 2025-05-07T20:10:48.4224735Z libnccl.so.2 => not found 2025-05-07T20:10:48.4224825Z libcuda.so.1 => not found 2025-05-07T20:10:48.4225169Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f2c9a389000) 2025-05-07T20:10:48.4225282Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4225374Z libtorch.so => not found 2025-05-07T20:10:48.4225469Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4225584Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4225674Z libtorch.so => not found 2025-05-07T20:10:48.4225762Z libc10.so => not found 2025-05-07T20:10:48.4225874Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4225965Z libc10_cuda.so => not found 2025-05-07T20:10:48.4226082Z libnccl.so.2 => not found 2025-05-07T20:10:48.4226172Z libcuda.so.1 => not found 2025-05-07T20:10:48.4226285Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4226379Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4226475Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4226586Z libcudart.so.12 => not found 2025-05-07T20:10:48.4226679Z libtorch.so => not found 2025-05-07T20:10:48.4226767Z libc10.so => not found 2025-05-07T20:10:48.4226861Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.4226969Z libc10_cuda.so => not found 2025-05-07T20:10:48.4227060Z libnccl.so.2 => not found 2025-05-07T20:10:48.4227151Z libcuda.so.1 => not found 2025-05-07T20:10:48.4227261Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.4227353Z libtorch_cpu.so => not found 2025-05-07T20:10:48.4227447Z libtorch_cuda.so => not found 2025-05-07T20:10:48.4227577Z librt.so.1 => /lib64/librt.so.1 (0x00007f2c9a384000) 2025-05-07T20:10:48.4227583Z 2025-05-07T20:10:48.4227712Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.4227988Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:10:48.4227993Z 2025-05-07T20:10:48.4262712Z 2025-05-07T20:10:48.4263576Z Dynamic section at offset 0xa44058 contains 42 entries: 2025-05-07T20:10:48.4263748Z Tag Type Name/Value 2025-05-07T20:10:48.4264208Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.4264473Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.4264970Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.4265197Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.4265448Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.4265881Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:48.4266129Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:48.4266425Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:48.4266654Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.4266867Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.4267129Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:48.4267352Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.4267571Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.4267788Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.4268024Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.4268225Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.4268509Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.4268805Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:10:48.4269007Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.4269199Z 0x000000000000000c (INIT) 0x190000 2025-05-07T20:10:48.4269538Z 0x000000000000000d (FINI) 0x8ac368 2025-05-07T20:10:48.4269664Z 0x0000000000000019 (INIT_ARRAY) 0xa37c40 2025-05-07T20:10:48.4269809Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:10:48.4269970Z 0x000000000000001a (FINI_ARRAY) 0xa37d40 2025-05-07T20:10:48.4270102Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.4270233Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.4270394Z 0x0000000000000005 (STRTAB) 0x23008 2025-05-07T20:10:48.4270523Z 0x0000000000000006 (SYMTAB) 0x93e8 2025-05-07T20:10:48.4270739Z 0x000000000000000a (STRSZ) 1248185 (bytes) 2025-05-07T20:10:48.4270878Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.4271035Z 0x0000000000000003 (PLTGOT) 0xa47fe8 2025-05-07T20:10:48.4271190Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:10:48.4271413Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.4271574Z 0x0000000000000017 (JMPREL) 0x184d90 2025-05-07T20:10:48.4271704Z 0x0000000000000007 (RELA) 0x155f30 2025-05-07T20:10:48.4271849Z 0x0000000000000008 (RELASZ) 192096 (bytes) 2025-05-07T20:10:48.4272013Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.4272150Z 0x000000006ffffffe (VERNEED) 0x155e20 2025-05-07T20:10:48.4272272Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:48.4272407Z 0x000000006ffffff0 (VERSYM) 0x153bc2 2025-05-07T20:10:48.4272556Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:10:48.4272674Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.4272681Z 2025-05-07T20:10:48.4272804Z ################################################################################ 2025-05-07T20:10:48.4272809Z 2025-05-07T20:10:48.4272813Z 2025-05-07T20:10:48.4272952Z ################################################################################ 2025-05-07T20:10:48.4273358Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:48.4273479Z [CHECK] Listing out library size: 2025-05-07T20:10:48.4273790Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:48.4273795Z 2025-05-07T20:10:48.4277525Z 429 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:48.4278356Z 2025-05-07T20:10:48.4281011Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:48.4282566Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.4282598Z 2025-05-07T20:10:48.4682150Z GLIBC_2.2.5 2025-05-07T20:10:48.4682448Z GLIBC_2.14 2025-05-07T20:10:48.4683401Z 2025-05-07T20:10:48.4683446Z 2025-05-07T20:10:48.4684906Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:48.4685468Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.4685474Z 2025-05-07T20:10:48.5084435Z GLIBCXX_3.4 2025-05-07T20:10:48.5084705Z GLIBCXX_3.4.9 2025-05-07T20:10:48.5084997Z GLIBCXX_3.4.11 2025-05-07T20:10:48.5085248Z GLIBCXX_3.4.14 2025-05-07T20:10:48.5085508Z GLIBCXX_3.4.18 2025-05-07T20:10:48.5085745Z GLIBCXX_3.4.20 2025-05-07T20:10:48.5086037Z GLIBCXX_3.4.21 2025-05-07T20:10:48.5086052Z 2025-05-07T20:10:48.5086057Z 2025-05-07T20:10:48.5108172Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.17Gu8DfxEP.symbols.txt 2025-05-07T20:10:48.5108218Z 2025-05-07T20:10:48.5461882Z 2025-05-07T20:10:48.5489724Z [CHECK] Total Number of symbols: 5083 2025-05-07T20:10:48.5513853Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:10:48.5532227Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.5bMRZESWkj.usymbols.txt 2025-05-07T20:10:48.5532259Z 2025-05-07T20:10:48.5564005Z 2025-05-07T20:10:48.5597041Z [CHECK] Listing out undefined symbols (246 total): 2025-05-07T20:10:48.5622438Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.5623541Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.5624183Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.5624617Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.5625229Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.5625614Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.5626026Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.5626404Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.5626782Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.5627181Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.5627543Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.5627670Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.5627786Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.5627901Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.5628043Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.5628155Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.5628268Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.5628379Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.5628498Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.5628606Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.5628803Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.5629035Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:48.5629604Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.5630254Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.5630948Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.5631129Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:48.5631789Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.5631980Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.5632394Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:48.5633063Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:48.5633553Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.5633720Z U at::detail::getCUDAHooks() 2025-05-07T20:10:48.5633856Z U at::detail::getHIPHooks() 2025-05-07T20:10:48.5633965Z U at::get_thread_num() 2025-05-07T20:10:48.5634077Z U at::globalContext() 2025-05-07T20:10:48.5634223Z U at::internal::set_thread_num(int) 2025-05-07T20:10:48.5634405Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:48.5634596Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.5634833Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.5635089Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:10:48.5635475Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:10:48.5635670Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.5636349Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.5636733Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.5636866Z U c10::Error::what() const 2025-05-07T20:10:48.5636982Z U c10::GradMode::is_enabled() 2025-05-07T20:10:48.5637107Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:48.5637288Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.5637470Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.5637635Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:48.5637903Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:10:48.5638044Z U c10::IValue::isTensorList() const 2025-05-07T20:10:48.5638185Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.5638303Z U c10::IntType::get() 2025-05-07T20:10:48.5638758Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.5638946Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.5639085Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.5639188Z U c10::NoneType::get() 2025-05-07T20:10:48.5639407Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.5639557Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:48.5639678Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:48.5639841Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:48.5639965Z U c10::StringType::get() 2025-05-07T20:10:48.5640106Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.5640253Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.5640660Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.5640798Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.5640918Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.5641085Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:48.5641505Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:48.5641635Z U c10::TensorType::get() 2025-05-07T20:10:48.5642399Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:48.5642522Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.5643196Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.5643448Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:10:48.5643578Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.5643697Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.5643835Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.5643952Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.5644072Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.5644201Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.5644442Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.5644549Z U c10::cuda::device_count() 2025-05-07T20:10:48.5644703Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.5644838Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.5644981Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.5645134Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.5645289Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.5645404Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.5645861Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.5646357Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.5647359Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.5647626Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.5648097Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.5648440Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.5648991Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.5649151Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:10:48.5649275Z U c10::get_default_dtype() 2025-05-07T20:10:48.5649543Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:48.5649734Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:48.5649869Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.5650005Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.5650129Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.5650503Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.5650667Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:10:48.5650814Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:10:48.5651071Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:10:48.5651245Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.5651378Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.5651554Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:10:48.5651662Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.5651853Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.5651974Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.5652120Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.5652253Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.5652370Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.5652509Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.5652627Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.5652741Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.5652884Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.5653003Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.5653116Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.5653227Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.5653358Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.5653513Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.5653636Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.5654368Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5655164Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5655972Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5656701Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5657500Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5658306Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5658973Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:48.5659745Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:48.5660525Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5661334Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:48.5662199Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5662944Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:48.5663776Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:48.5664622Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5665788Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:48.5666611Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:48.5667497Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5668404Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:48.5669324Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5670150Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:10:48.5671054Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:10:48.5672082Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5672991Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5673919Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5674875Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5675750Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5676678Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5677644Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:48.5677801Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.5677971Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.5678146Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.5678301Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.5678724Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:10:48.5678922Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.5679088Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.5679242Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.5679854Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:10:48.5680291Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.5680396Z U memchr@GLIBC_2.2.5 2025-05-07T20:10:48.5680511Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.5680614Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.5680708Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.5680828Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.5680966Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.5681193Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:48.5681546Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.5681956Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.5682320Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.5683091Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:10:48.5683475Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.5683859Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.5684051Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:10:48.5684195Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:10:48.5684322Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.5684460Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.5684597Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:48.5684736Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.5684883Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.5685057Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.5685188Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.5685437Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.5686016Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.5686201Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:48.5686388Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:10:48.5686572Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:10:48.5686693Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:10:48.5686827Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.5686947Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.5687086Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.5687211Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.5687401Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:10:48.5687513Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.5687941Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.5688073Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:10:48.5688256Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.5688507Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.5688630Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.5688794Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.5688939Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:48.5689168Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:10:48.5689346Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.5689493Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:48.5689623Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.5689721Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:48.5689815Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.5689946Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.5690539Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.5691019Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.5691308Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.5692390Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:10:48.5692768Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:10:48.5693135Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.5693665Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:10:48.5693815Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:10:48.5694123Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:10:48.5694608Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:10:48.5694923Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:10:48.5695202Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:10:48.5695664Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:10:48.5695784Z U typeinfo for c10::Error 2025-05-07T20:10:48.5695905Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:48.5696031Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:10:48.5696153Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:48.5696274Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:48.5696457Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.5696661Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.5696804Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.5696957Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.5697102Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.5697195Z U vtable for c10::Error 2025-05-07T20:10:48.5697510Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.5697728Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.5697850Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:48.5697965Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.5698090Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.5698191Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.5698275Z w __gmon_start__ 2025-05-07T20:10:48.5698372Z w __pthread_key_create 2025-05-07T20:10:48.5698475Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.5698578Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.5698722Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.5698920Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:48.5698927Z 2025-05-07T20:10:48.5699077Z linux-vdso.so.1 (0x00007ffd73514000) 2025-05-07T20:10:48.5699171Z libc10.so => not found 2025-05-07T20:10:48.5699259Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.5699345Z libc10_cuda.so => not found 2025-05-07T20:10:48.5699445Z libnccl.so.2 => not found 2025-05-07T20:10:48.5699529Z libcuda.so.1 => not found 2025-05-07T20:10:48.5699875Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007f546ce00000) 2025-05-07T20:10:48.5700309Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f546b600000) 2025-05-07T20:10:48.5700734Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007f54885ad000) 2025-05-07T20:10:48.5700833Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.5700919Z libtorch.so => not found 2025-05-07T20:10:48.5701018Z libtorch_cpu.so => not found 2025-05-07T20:10:48.5701104Z libtorch_cuda.so => not found 2025-05-07T20:10:48.5701193Z libcudart.so.12 => not found 2025-05-07T20:10:48.5701351Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f546b39c000) 2025-05-07T20:10:48.5701488Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f548857d000) 2025-05-07T20:10:48.5701604Z libc.so.6 => /lib64/libc.so.6 (0x00007f546b194000) 2025-05-07T20:10:48.5701687Z libc10.so => not found 2025-05-07T20:10:48.5701782Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.5701893Z libc10_cuda.so => not found 2025-05-07T20:10:48.5701975Z libnccl.so.2 => not found 2025-05-07T20:10:48.5702066Z libcuda.so.1 => not found 2025-05-07T20:10:48.5702397Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f546d389000) 2025-05-07T20:10:48.5702489Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.5702574Z libtorch.so => not found 2025-05-07T20:10:48.5702670Z libtorch_cpu.so => not found 2025-05-07T20:10:48.5702795Z libtorch_cuda.so => not found 2025-05-07T20:10:48.5702915Z libm.so.6 => /lib64/libm.so.6 (0x00007f546cd25000) 2025-05-07T20:10:48.5703036Z /lib64/ld-linux-x86-64.so.2 (0x00007f54885be000) 2025-05-07T20:10:48.5703120Z libtorch.so => not found 2025-05-07T20:10:48.5703199Z libc10.so => not found 2025-05-07T20:10:48.5703283Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.5703372Z libc10_cuda.so => not found 2025-05-07T20:10:48.5703458Z libnccl.so.2 => not found 2025-05-07T20:10:48.5703545Z libcuda.so.1 => not found 2025-05-07T20:10:48.5703650Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.5703737Z libtorch_cpu.so => not found 2025-05-07T20:10:48.5703830Z libtorch_cuda.so => not found 2025-05-07T20:10:48.5703917Z libcudart.so.12 => not found 2025-05-07T20:10:48.5704063Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f548851f000) 2025-05-07T20:10:48.5704148Z libtorch.so => not found 2025-05-07T20:10:48.5704230Z libc10.so => not found 2025-05-07T20:10:48.5704330Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.5704415Z libc10_cuda.so => not found 2025-05-07T20:10:48.5704501Z libnccl.so.2 => not found 2025-05-07T20:10:48.5704596Z libcuda.so.1 => not found 2025-05-07T20:10:48.5704685Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.5704775Z libtorch_cpu.so => not found 2025-05-07T20:10:48.5704864Z libtorch_cuda.so => not found 2025-05-07T20:10:48.5705035Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f546d384000) 2025-05-07T20:10:48.5705146Z libtorch.so => not found 2025-05-07T20:10:48.5705231Z libc10.so => not found 2025-05-07T20:10:48.5705322Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.5705403Z libc10_cuda.so => not found 2025-05-07T20:10:48.5705485Z libnccl.so.2 => not found 2025-05-07T20:10:48.5705565Z libcuda.so.1 => not found 2025-05-07T20:10:48.5705664Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.5705755Z libtorch_cpu.so => not found 2025-05-07T20:10:48.5705843Z libtorch_cuda.so => not found 2025-05-07T20:10:48.5705972Z librt.so.1 => /lib64/librt.so.1 (0x00007f546d37b000) 2025-05-07T20:10:48.5705977Z 2025-05-07T20:10:48.5706102Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.5706321Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:10:48.5706326Z 2025-05-07T20:10:48.5731057Z 2025-05-07T20:10:48.5731425Z Dynamic section at offset 0x1ac7bfc8 contains 41 entries: 2025-05-07T20:10:48.5731640Z Tag Type Name/Value 2025-05-07T20:10:48.5733219Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.5733886Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.5734507Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.5735105Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.5735686Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.5736258Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:48.5736976Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:48.5737612Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:48.5738009Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.5738228Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.5738551Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.5738755Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.5738986Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.5739178Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.5739369Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.5739636Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.5739864Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:10:48.5740039Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.5740162Z 0x000000000000000c (INIT) 0x1a0000 2025-05-07T20:10:48.5740296Z 0x000000000000000d (FINI) 0x74838c 2025-05-07T20:10:48.5740422Z 0x0000000000000019 (INIT_ARRAY) 0x1ac7aca0 2025-05-07T20:10:48.5740555Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:10:48.5740698Z 0x000000000000001a (FINI_ARRAY) 0x1ac7ae28 2025-05-07T20:10:48.5740819Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.5740941Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.5741091Z 0x0000000000000005 (STRTAB) 0x27a50 2025-05-07T20:10:48.5741211Z 0x0000000000000006 (SYMTAB) 0x9db0 2025-05-07T20:10:48.5741360Z 0x000000000000000a (STRSZ) 1387089 (bytes) 2025-05-07T20:10:48.5741489Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.5741650Z 0x0000000000000003 (PLTGOT) 0x1ac84fe8 2025-05-07T20:10:48.5741798Z 0x0000000000000002 (PLTRELSZ) 20568 (bytes) 2025-05-07T20:10:48.5741915Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.5742068Z 0x0000000000000017 (JMPREL) 0x19af18 2025-05-07T20:10:48.5742234Z 0x0000000000000007 (RELA) 0x17cd80 2025-05-07T20:10:48.5742376Z 0x0000000000000008 (RELASZ) 123288 (bytes) 2025-05-07T20:10:48.5742504Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.5742663Z 0x000000006ffffffe (VERNEED) 0x17cc60 2025-05-07T20:10:48.5742783Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:48.5742910Z 0x000000006ffffff0 (VERSYM) 0x17a4a2 2025-05-07T20:10:48.5743055Z 0x000000006ffffff9 (RELACOUNT) 539 2025-05-07T20:10:48.5743172Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.5743236Z 2025-05-07T20:10:48.5743362Z ################################################################################ 2025-05-07T20:10:48.5743367Z 2025-05-07T20:10:48.5743370Z 2025-05-07T20:10:48.5743522Z ################################################################################ 2025-05-07T20:10:48.5743882Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.5744006Z [CHECK] Listing out library size: 2025-05-07T20:10:48.5744387Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.5744391Z 2025-05-07T20:10:48.5752211Z 5 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.5752230Z 2025-05-07T20:10:48.5753722Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.5755587Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.5755600Z 2025-05-07T20:10:48.6023654Z GLIBC_2.2.5 2025-05-07T20:10:48.6023977Z GLIBC_2.3 2025-05-07T20:10:48.6024224Z GLIBC_2.14 2025-05-07T20:10:48.6024241Z 2025-05-07T20:10:48.6024279Z 2025-05-07T20:10:48.6026089Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.6027940Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.6027957Z 2025-05-07T20:10:48.6297498Z GLIBCXX_3.4 2025-05-07T20:10:48.6297882Z GLIBCXX_3.4.9 2025-05-07T20:10:48.6298725Z GLIBCXX_3.4.11 2025-05-07T20:10:48.6299016Z GLIBCXX_3.4.15 2025-05-07T20:10:48.6299488Z GLIBCXX_3.4.18 2025-05-07T20:10:48.6299722Z GLIBCXX_3.4.20 2025-05-07T20:10:48.6299955Z GLIBCXX_3.4.21 2025-05-07T20:10:48.6300087Z 2025-05-07T20:10:48.6300106Z 2025-05-07T20:10:48.6315318Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.SBa1zqwChG.symbols.txt 2025-05-07T20:10:48.6315884Z 2025-05-07T20:10:48.6543663Z 2025-05-07T20:10:48.6574588Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:10:48.6595984Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:10:48.6615740Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.6GnDoPlLDI.usymbols.txt 2025-05-07T20:10:48.6617462Z 2025-05-07T20:10:48.6637308Z 2025-05-07T20:10:48.6666621Z [CHECK] Listing out undefined symbols (189 total): 2025-05-07T20:10:48.6690194Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.6691076Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.6691646Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.6691992Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.6692345Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.6692665Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.6693157Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.6693483Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.6693823Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.6694142Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.6694479Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.6694795Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.6695108Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.6695440Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.6695832Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.6696155Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:48.6696553Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:48.6696984Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:48.6697454Z U at::RecordFunction::end() 2025-05-07T20:10:48.6697909Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:48.6698278Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:48.6699214Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6700339Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:48.6701304Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6702613Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.6703494Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:48.6703946Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.6704407Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.6704846Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:48.6705281Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.6705681Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:48.6706060Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:48.6706429Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:48.6706761Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:48.6707035Z U c10::AnyType::get() 2025-05-07T20:10:48.6707330Z U c10::BoolType::get() 2025-05-07T20:10:48.6707686Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:48.6708087Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:48.6708798Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:48.6709978Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:48.6711040Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.6711885Z U c10::Error::what() const 2025-05-07T20:10:48.6712198Z U c10::FloatType::get() 2025-05-07T20:10:48.6712570Z U c10::GradMode::is_enabled() 2025-05-07T20:10:48.6712893Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:48.6713298Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:48.6713691Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:48.6714030Z U c10::IValue::isBoolList() const 2025-05-07T20:10:48.6714357Z U c10::IValue::isIntList() const 2025-05-07T20:10:48.6714703Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:48.6715038Z U c10::IValue::isTensorList() const 2025-05-07T20:10:48.6715449Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.6715815Z U c10::IntType::get() 2025-05-07T20:10:48.6716505Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.6717284Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.6717688Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.6718155Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.6718508Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.6718932Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.6719524Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:48.6719989Z U c10::StringType::get() 2025-05-07T20:10:48.6720326Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:48.6720716Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.6721154Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:48.6721599Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.6722002Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:48.6722631Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.6723256Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.6723616Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:48.6724002Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:48.6724375Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:48.6724712Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.6725054Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:48.6725401Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:48.6725746Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:48.6726080Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.6726374Z U c10::SymIntType::get() 2025-05-07T20:10:48.6726681Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:48.6726986Z U c10::TensorType::get() 2025-05-07T20:10:48.6727298Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.6727925Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:48.6728918Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.6729746Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.6730599Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.6731485Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.6732455Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.6733415Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:48.6734031Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:48.6734433Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.6734787Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:48.6735574Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.6736179Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:48.6736569Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:48.6737188Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:48.6737598Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:48.6738078Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.6738528Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:48.6739019Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:48.6739507Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.6739869Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.6740274Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:48.6740661Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.6740955Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.6741260Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.6741570Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.6741933Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.6742267Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:48.6742723Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:48.6743421Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.6744271Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.6745125Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:48.6746069Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.6747001Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:48.6747579Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.6747899Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:48.6748254Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.6748626Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.6749030Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.6749431Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:48.6749821Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:48.6750293Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.6751174Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.6752220Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.6752596Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.6752983Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.6753346Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.6753700Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.6754106Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.6754663Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.6755144Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.6755564Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:48.6755980Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:48.6756682Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:48.6757390Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:48.6757749Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.6758162Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:48.6758427Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.6758730Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.6759537Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.6760616Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.6761404Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.6761907Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:48.6762403Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:48.6762972Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:48.6763439Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:48.6763935Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:48.6764579Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:48.6765523Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:48.6766025Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:48.6766535Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:48.6766994Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:48.6767383Z U torch::autograd::Node::metadata() 2025-05-07T20:10:48.6767757Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:48.6768293Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:48.6768956Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:48.6769582Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:48.6770097Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:48.6770671Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:48.6773704Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:48.6776513Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:48.6776908Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:48.6777309Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:48.6778339Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:48.6779337Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:48.6780151Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:48.6781074Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.6781846Z U typeinfo for c10::Error 2025-05-07T20:10:48.6782194Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.6782586Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:48.6782955Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:48.6783391Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:48.6783781Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:48.6784165Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.6784618Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.6785064Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:48.6785514Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.6785924Z U vtable for c10::Error 2025-05-07T20:10:48.6786477Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.6787097Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:48.6787578Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.6788065Z U vtable for torch::autograd::Node 2025-05-07T20:10:48.6788499Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.6788914Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.6789256Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.6789579Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.6790061Z w __gmon_start__ 2025-05-07T20:10:48.6790352Z w __pthread_key_create 2025-05-07T20:10:48.6790690Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:48.6791027Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:48.6791495Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.6792051Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.6792453Z 2025-05-07T20:10:48.6792597Z linux-vdso.so.1 (0x00007ffd3afdd000) 2025-05-07T20:10:48.6792903Z libc10.so => not found 2025-05-07T20:10:48.6793164Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6793460Z libc10_cuda.so => not found 2025-05-07T20:10:48.6793733Z libnccl.so.2 => not found 2025-05-07T20:10:48.6793990Z libcuda.so.1 => not found 2025-05-07T20:10:48.6794612Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007f2d1134a000) 2025-05-07T20:10:48.6795646Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f2d0fc00000) 2025-05-07T20:10:48.6796328Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6796614Z libtorch.so => not found 2025-05-07T20:10:48.6796871Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6797150Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6797480Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2d0f99c000) 2025-05-07T20:10:48.6797914Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2d1131a000) 2025-05-07T20:10:48.6798297Z libc.so.6 => /lib64/libc.so.6 (0x00007f2d0f794000) 2025-05-07T20:10:48.6798673Z /lib64/ld-linux-x86-64.so.2 (0x00007f2d1135b000) 2025-05-07T20:10:48.6799002Z libtorch.so => not found 2025-05-07T20:10:48.6799256Z libc10.so => not found 2025-05-07T20:10:48.6799501Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6799774Z libc10_cuda.so => not found 2025-05-07T20:10:48.6800047Z libnccl.so.2 => not found 2025-05-07T20:10:48.6800301Z libcuda.so.1 => not found 2025-05-07T20:10:48.6800606Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6801067Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6801348Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6801670Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2d112c0000) 2025-05-07T20:10:48.6802114Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2d10dfb000) 2025-05-07T20:10:48.6802487Z libtorch.so => not found 2025-05-07T20:10:48.6802744Z libc10.so => not found 2025-05-07T20:10:48.6802986Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.6803287Z libc10_cuda.so => not found 2025-05-07T20:10:48.6803565Z libnccl.so.2 => not found 2025-05-07T20:10:48.6803820Z libcuda.so.1 => not found 2025-05-07T20:10:48.6804091Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.6804366Z libtorch_cpu.so => not found 2025-05-07T20:10:48.6804649Z libtorch_cuda.so => not found 2025-05-07T20:10:48.6804917Z libcudart.so.12 => not found 2025-05-07T20:10:48.6805231Z libm.so.6 => /lib64/libm.so.6 (0x00007f2d10d1c000) 2025-05-07T20:10:48.6805474Z 2025-05-07T20:10:48.6805590Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.6806120Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:10:48.6806553Z 2025-05-07T20:10:48.6806557Z 2025-05-07T20:10:48.6806728Z Dynamic section at offset 0x4b5fc8 contains 40 entries: 2025-05-07T20:10:48.6807105Z Tag Type Name/Value 2025-05-07T20:10:48.6807534Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.6808046Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.6808571Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.6809093Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.6809709Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.6810255Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:10:48.6810788Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:48.6811336Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.6811860Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.6812360Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.6812890Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.6813400Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.6813962Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.6814450Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.6814969Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.6815607Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:10:48.6816198Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.6816609Z 0x000000000000000c (INIT) 0xd6000 2025-05-07T20:10:48.6816936Z 0x000000000000000d (FINI) 0x3f64b8 2025-05-07T20:10:48.6817284Z 0x0000000000000019 (INIT_ARRAY) 0x4add80 2025-05-07T20:10:48.6817631Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:10:48.6817992Z 0x000000000000001a (FINI_ARRAY) 0x4adeb0 2025-05-07T20:10:48.6818341Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.6818680Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.6819019Z 0x0000000000000005 (STRTAB) 0x16e00 2025-05-07T20:10:48.6819336Z 0x0000000000000006 (SYMTAB) 0x55e0 2025-05-07T20:10:48.6819698Z 0x000000000000000a (STRSZ) 609767 (bytes) 2025-05-07T20:10:48.6820056Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.6820440Z 0x0000000000000003 (PLTGOT) 0x4b8fe8 2025-05-07T20:10:48.6820808Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:10:48.6821153Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.6821484Z 0x0000000000000017 (JMPREL) 0xcdaf0 2025-05-07T20:10:48.6821807Z 0x0000000000000007 (RELA) 0xad450 2025-05-07T20:10:48.6822160Z 0x0000000000000008 (RELASZ) 132768 (bytes) 2025-05-07T20:10:48.6822538Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.6822891Z 0x000000006ffffffe (VERNEED) 0xad340 2025-05-07T20:10:48.6823217Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:10:48.6823546Z 0x000000006ffffff0 (VERSYM) 0xabbe8 2025-05-07T20:10:48.6823883Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:10:48.6824181Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.6824381Z 2025-05-07T20:10:48.6824509Z ################################################################################ 2025-05-07T20:10:48.6824730Z 2025-05-07T20:10:48.6824734Z 2025-05-07T20:10:48.6824843Z ################################################################################ 2025-05-07T20:10:48.6825393Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.6825948Z [CHECK] Listing out library size: 2025-05-07T20:10:48.6826439Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.6826846Z 2025-05-07T20:10:48.6827118Z 339 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.6827465Z 2025-05-07T20:10:48.6827887Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.6829040Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.6829662Z 2025-05-07T20:10:48.7758565Z GLIBC_2.2.5 2025-05-07T20:10:48.7758821Z GLIBC_2.3 2025-05-07T20:10:48.7759064Z GLIBC_2.14 2025-05-07T20:10:48.7759188Z 2025-05-07T20:10:48.7759193Z 2025-05-07T20:10:48.7759664Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.7760813Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.7761645Z 2025-05-07T20:10:48.8713564Z GLIBCXX_3.4 2025-05-07T20:10:48.8714195Z GLIBCXX_3.4.9 2025-05-07T20:10:48.8714821Z GLIBCXX_3.4.20 2025-05-07T20:10:48.8715335Z GLIBCXX_3.4.21 2025-05-07T20:10:48.8715638Z 2025-05-07T20:10:48.8715657Z 2025-05-07T20:10:48.8735978Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.hnkJGQR78d.symbols.txt 2025-05-07T20:10:48.8736599Z 2025-05-07T20:10:48.9664576Z 2025-05-07T20:10:48.9706966Z [CHECK] Total Number of symbols: 12626 2025-05-07T20:10:48.9754528Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:10:48.9771428Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.Axs0ybCdll.usymbols.txt 2025-05-07T20:10:48.9773005Z 2025-05-07T20:10:48.9822558Z 2025-05-07T20:10:48.9848307Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:48.9865108Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.9865846Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:48.9866237Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.9866678Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:48.9867117Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.9867717Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:48.9868130Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:48.9868498Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:48.9868892Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:48.9869271Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:48.9869627Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:48.9870026Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:48.9870367Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:48.9870720Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:48.9871052Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:48.9871495Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:48.9871823Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:48.9872168Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:48.9872547Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:48.9872880Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:48.9873200Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:48.9873610Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:48.9874050Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:48.9874593Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:48.9875325Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:10:48.9875986Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:48.9876615Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:10:48.9877876Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:48.9878778Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:48.9879268Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:48.9879743Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:48.9880173Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:48.9880663Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.9881146Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.9881545Z U c10::BoolType::get() 2025-05-07T20:10:48.9881901Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:48.9882325Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:48.9882734Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:48.9883434Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:48.9884643Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:48.9885716Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:48.9886272Z U c10::Error::what() const 2025-05-07T20:10:48.9886582Z U c10::FloatType::get() 2025-05-07T20:10:48.9886937Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.9887387Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.9887816Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:48.9888158Z U c10::IntType::get() 2025-05-07T20:10:48.9888531Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:48.9888939Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:48.9890861Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.9891251Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:48.9891621Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:48.9892029Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:48.9892412Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:48.9893071Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:48.9893719Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:48.9894082Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:10:48.9894468Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:48.9894829Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:48.9895190Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:48.9895570Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:10:48.9895967Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:48.9896339Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:48.9896710Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:48.9897082Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:48.9897415Z U c10::SymIntType::get() 2025-05-07T20:10:48.9897711Z U c10::TensorType::get() 2025-05-07T20:10:48.9898045Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:48.9898956Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:48.9899864Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:48.9900260Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:48.9900586Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:48.9900932Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:48.9901272Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:48.9901600Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:48.9902074Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:48.9902521Z U c10::cuda::device_count() 2025-05-07T20:10:48.9902869Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:48.9903231Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:48.9903625Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:48.9904018Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:48.9904408Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:48.9904792Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:48.9905495Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:48.9906376Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:48.9907227Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.9908235Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:48.9909271Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:48.9910046Z U c10::get_default_dtype() 2025-05-07T20:10:48.9910361Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:48.9910699Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:48.9911292Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:48.9912118Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:48.9912572Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:48.9912932Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.9913343Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:48.9913741Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:10:48.9914129Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:10:48.9914516Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:48.9914876Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:48.9915273Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:48.9915672Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:48.9916110Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:48.9916590Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:48.9916956Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:48.9917387Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:48.9917836Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:48.9918232Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:48.9918604Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:48.9918987Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:48.9919383Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:48.9919747Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:48.9920122Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:48.9920482Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:48.9920885Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:48.9921236Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:48.9921603Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:48.9921966Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:48.9922343Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:48.9922887Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:48.9923415Z U float at::Tensor::item() const 2025-05-07T20:10:48.9923802Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.9924221Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.9924608Z U free@GLIBC_2.2.5 2025-05-07T20:10:48.9924915Z U int at::Tensor::item() const 2025-05-07T20:10:48.9925290Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.9925751Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.9926195Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:48.9926642Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:48.9927047Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:48.9927439Z U memcpy@GLIBC_2.14 2025-05-07T20:10:48.9927756Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:48.9928084Z U memset@GLIBC_2.2.5 2025-05-07T20:10:48.9928416Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:48.9928776Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:48.9929382Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:48.9930242Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:48.9930894Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:48.9931284Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.9931710Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:48.9932165Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:48.9932717Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:48.9933694Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.9934554Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:48.9935071Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:48.9935592Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:48.9935956Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:48.9936290Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:48.9936703Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.9937223Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:48.9937706Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:48.9938043Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:48.9938405Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:48.9938736Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:48.9939528Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:48.9940656Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.9941469Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:48.9942173Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:48.9942762Z U typeinfo for c10::Error 2025-05-07T20:10:48.9943116Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:48.9943555Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:48.9943993Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:48.9944356Z U vtable for c10::Error 2025-05-07T20:10:48.9944920Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:48.9945555Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:48.9946064Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:48.9946468Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:48.9946775Z w _ITM_registerTMCloneTable 2025-05-07T20:10:48.9947091Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:48.9947404Z w __gmon_start__ 2025-05-07T20:10:48.9947686Z w __pthread_key_create 2025-05-07T20:10:48.9948019Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:48.9948504Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.9948842Z 2025-05-07T20:10:48.9949004Z linux-vdso.so.1 (0x00007fff7838f000) 2025-05-07T20:10:48.9949281Z libc10.so => not found 2025-05-07T20:10:48.9949543Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.9949793Z libc10_cuda.so => not found 2025-05-07T20:10:48.9950065Z libnccl.so.2 => not found 2025-05-07T20:10:48.9950317Z libcuda.so.1 => not found 2025-05-07T20:10:48.9950954Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fd73c600000) 2025-05-07T20:10:48.9951908Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.9952190Z libtorch.so => not found 2025-05-07T20:10:48.9952482Z libtorch_cpu.so => not found 2025-05-07T20:10:48.9952764Z libtorch_cuda.so => not found 2025-05-07T20:10:48.9953060Z libcudart.so.12 => not found 2025-05-07T20:10:48.9953406Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd73c39c000) 2025-05-07T20:10:48.9953860Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd752671000) 2025-05-07T20:10:48.9954248Z libc.so.6 => /lib64/libc.so.6 (0x00007fd73c194000) 2025-05-07T20:10:48.9954674Z /lib64/ld-linux-x86-64.so.2 (0x00007fd7526a7000) 2025-05-07T20:10:48.9955022Z libc10.so => not found 2025-05-07T20:10:48.9955277Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.9955568Z libc10_cuda.so => not found 2025-05-07T20:10:48.9955836Z libnccl.so.2 => not found 2025-05-07T20:10:48.9956117Z libcuda.so.1 => not found 2025-05-07T20:10:48.9956654Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007fd73bc00000) 2025-05-07T20:10:48.9957130Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007fd752664000) 2025-05-07T20:10:48.9957268Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.9957367Z libtorch.so => not found 2025-05-07T20:10:48.9957487Z libtorch_cpu.so => not found 2025-05-07T20:10:48.9957590Z libtorch_cuda.so => not found 2025-05-07T20:10:48.9957694Z libcudart.so.12 => not found 2025-05-07T20:10:48.9957851Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fd75260c000) 2025-05-07T20:10:48.9958000Z libm.so.6 => /lib64/libm.so.6 (0x00007fd752531000) 2025-05-07T20:10:48.9958099Z libc10.so => not found 2025-05-07T20:10:48.9958196Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.9958314Z libc10_cuda.so => not found 2025-05-07T20:10:48.9958414Z libnccl.so.2 => not found 2025-05-07T20:10:48.9958512Z libcuda.so.1 => not found 2025-05-07T20:10:48.9958883Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007fd73c989000) 2025-05-07T20:10:48.9959004Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.9959109Z libtorch.so => not found 2025-05-07T20:10:48.9959213Z libtorch_cpu.so => not found 2025-05-07T20:10:48.9959336Z libtorch_cuda.so => not found 2025-05-07T20:10:48.9959433Z libtorch.so => not found 2025-05-07T20:10:48.9959527Z libc10.so => not found 2025-05-07T20:10:48.9959631Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.9959747Z libc10_cuda.so => not found 2025-05-07T20:10:48.9959847Z libnccl.so.2 => not found 2025-05-07T20:10:48.9959951Z libcuda.so.1 => not found 2025-05-07T20:10:48.9960105Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.9960208Z libtorch_cpu.so => not found 2025-05-07T20:10:48.9960311Z libtorch_cuda.so => not found 2025-05-07T20:10:48.9960501Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fd752524000) 2025-05-07T20:10:48.9960618Z libtorch.so => not found 2025-05-07T20:10:48.9960714Z libc10.so => not found 2025-05-07T20:10:48.9960816Z libnvrtc.so.12 => not found 2025-05-07T20:10:48.9960935Z libc10_cuda.so => not found 2025-05-07T20:10:48.9961066Z libnccl.so.2 => not found 2025-05-07T20:10:48.9961162Z libcuda.so.1 => not found 2025-05-07T20:10:48.9961276Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:48.9961397Z libtorch_cpu.so => not found 2025-05-07T20:10:48.9961502Z libtorch_cuda.so => not found 2025-05-07T20:10:48.9961644Z librt.so.1 => /lib64/librt.so.1 (0x00007fd75251b000) 2025-05-07T20:10:48.9961651Z 2025-05-07T20:10:48.9961780Z [CHECK] Displaying ELF information: 2025-05-07T20:10:48.9962057Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:10:48.9962062Z 2025-05-07T20:10:48.9980821Z 2025-05-07T20:10:48.9981565Z Dynamic section at offset 0x15292018 contains 40 entries: 2025-05-07T20:10:48.9982169Z Tag Type Name/Value 2025-05-07T20:10:48.9982963Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:48.9983593Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:48.9984204Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:48.9984796Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:48.9985370Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:48.9986050Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:48.9986753Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:48.9987121Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:48.9987345Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:48.9987553Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:48.9987766Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:48.9987982Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:48.9988186Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:48.9988438Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:48.9988680Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:48.9988949Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:10:48.9989139Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:48.9989264Z 0x000000000000000c (INIT) 0x453000 2025-05-07T20:10:48.9989407Z 0x000000000000000d (FINI) 0x1fe941c 2025-05-07T20:10:48.9989536Z 0x0000000000000019 (INIT_ARRAY) 0x152889a8 2025-05-07T20:10:48.9989673Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:10:48.9989828Z 0x000000000000001a (FINI_ARRAY) 0x15288c98 2025-05-07T20:10:48.9989956Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:48.9990078Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:10:48.9990215Z 0x0000000000000005 (STRTAB) 0x624b8 2025-05-07T20:10:48.9990330Z 0x0000000000000006 (SYMTAB) 0x184f0 2025-05-07T20:10:48.9990475Z 0x000000000000000a (STRSZ) 3694099 (bytes) 2025-05-07T20:10:48.9990598Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:48.9990736Z 0x0000000000000003 (PLTGOT) 0x152a8fe8 2025-05-07T20:10:48.9990880Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:10:48.9991037Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:48.9991175Z 0x0000000000000017 (JMPREL) 0x44ece0 2025-05-07T20:10:48.9991428Z 0x0000000000000007 (RELA) 0x3ee668 2025-05-07T20:10:48.9991571Z 0x0000000000000008 (RELASZ) 394872 (bytes) 2025-05-07T20:10:48.9991697Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:48.9991853Z 0x000000006ffffffe (VERNEED) 0x3ee578 2025-05-07T20:10:48.9992018Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:48.9992149Z 0x000000006ffffff0 (VERSYM) 0x3e82cc 2025-05-07T20:10:48.9992278Z 0x000000006ffffff9 (RELACOUNT) 1976 2025-05-07T20:10:48.9992460Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:48.9992465Z 2025-05-07T20:10:48.9992588Z ################################################################################ 2025-05-07T20:10:48.9992595Z 2025-05-07T20:10:48.9992599Z 2025-05-07T20:10:48.9992739Z ################################################################################ 2025-05-07T20:10:48.9993067Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.9993181Z [CHECK] Listing out library size: 2025-05-07T20:10:48.9993521Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.9993526Z 2025-05-07T20:10:48.9993815Z 1 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.9995138Z 2025-05-07T20:10:48.9995879Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:48.9996435Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:48.9996488Z 2025-05-07T20:10:49.0056090Z GLIBC_2.2.5 2025-05-07T20:10:49.0056575Z GLIBC_2.3 2025-05-07T20:10:49.0056834Z GLIBC_2.14 2025-05-07T20:10:49.0056876Z 2025-05-07T20:10:49.0056890Z 2025-05-07T20:10:49.0058270Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:49.0059981Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.0059997Z 2025-05-07T20:10:49.0115930Z GLIBCXX_3.4 2025-05-07T20:10:49.0116191Z GLIBCXX_3.4.9 2025-05-07T20:10:49.0116769Z GLIBCXX_3.4.18 2025-05-07T20:10:49.0117009Z GLIBCXX_3.4.20 2025-05-07T20:10:49.0117243Z GLIBCXX_3.4.21 2025-05-07T20:10:49.0117280Z 2025-05-07T20:10:49.0117293Z 2025-05-07T20:10:49.0139809Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.ZUF6oO3w4e.symbols.txt 2025-05-07T20:10:49.0139881Z 2025-05-07T20:10:49.0166061Z 2025-05-07T20:10:49.0193793Z [CHECK] Total Number of symbols: 357 2025-05-07T20:10:49.0208046Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:10:49.0224971Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.KvtnaeKxLY.usymbols.txt 2025-05-07T20:10:49.0224987Z 2025-05-07T20:10:49.0238468Z 2025-05-07T20:10:49.0266517Z [CHECK] Listing out undefined symbols (118 total): 2025-05-07T20:10:49.0279664Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.0280556Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.0280707Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.0280866Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.0281051Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.0281408Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.0281643Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.0294725Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.0295024Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.0295168Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.0295292Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.0295422Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.0295694Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.0295831Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:49.0295943Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:49.0296052Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:49.0296156Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.0296286Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.0296406Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.0296511Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.0297138Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.0297781Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.0297954Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.0298123Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.0298226Z U c10::IntType::get() 2025-05-07T20:10:49.0298399Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.0298588Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.0298813Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.0299217Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.0299375Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.0299497Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.0299606Z U c10::TensorType::get() 2025-05-07T20:10:49.0299794Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.0300503Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.0300642Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.0300787Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.0300911Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.0301031Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.0301171Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.0301287Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.0301543Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.0301667Z U c10::cuda::device_count() 2025-05-07T20:10:49.0301813Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.0301948Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.0302113Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.0302252Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.0302443Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.0302581Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.0303094Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.0303347Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.0303874Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.0304214Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.0304794Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.0304935Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.0305048Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.0305172Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.0305332Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.0305467Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.0305581Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.0305795Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.0305924Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.0306060Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.0306200Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.0306356Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.0306475Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.0306593Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.0306739Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.0306882Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.0307007Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.0307146Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.0307270Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.0307583Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.0307725Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.0307847Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.0307991Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0308168Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.0308334Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0308429Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.0308527Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.0308640Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.0308753Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.0308874Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.0309220Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.0309594Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.0309909Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.0310300Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.0310413Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.0310528Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.0310683Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.0310817Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.0311004Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.0311381Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.0312132Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.0312261Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.0312415Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.0312626Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.0312741Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.0312947Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.0313199Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.0313332Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.0313468Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.0313572Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.0313696Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.0314323Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.0314829Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.0315098Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.0315486Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.0315732Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0315894Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.0316078Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.0316239Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.0316583Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.0316833Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.0316979Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.0317091Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.0317216Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.0317312Z w __gmon_start__ 2025-05-07T20:10:49.0317419Z w __pthread_key_create 2025-05-07T20:10:49.0317596Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.0317854Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:49.0317861Z 2025-05-07T20:10:49.0320564Z linux-vdso.so.1 (0x00007ffccf8c6000) 2025-05-07T20:10:49.0321044Z libtorch.so => not found 2025-05-07T20:10:49.0321172Z libc10.so => not found 2025-05-07T20:10:49.0321416Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.0321540Z libc10_cuda.so => not found 2025-05-07T20:10:49.0321643Z libnccl.so.2 => not found 2025-05-07T20:10:49.0321747Z libcuda.so.1 => not found 2025-05-07T20:10:49.0321875Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.0321981Z libtorch_cpu.so => not found 2025-05-07T20:10:49.0322087Z libtorch_cuda.so => not found 2025-05-07T20:10:49.0322187Z libcudart.so.12 => not found 2025-05-07T20:10:49.0322397Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f59a9094000) 2025-05-07T20:10:49.0322611Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f59a903e000) 2025-05-07T20:10:49.0322782Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f59a9010000) 2025-05-07T20:10:49.0322999Z libc.so.6 => /lib64/libc.so.6 (0x00007f59a8e08000) 2025-05-07T20:10:49.0323139Z /lib64/ld-linux-x86-64.so.2 (0x00007f59a9373000) 2025-05-07T20:10:49.0323269Z libm.so.6 => /lib64/libm.so.6 (0x00007f59a8d2d000) 2025-05-07T20:10:49.0323289Z 2025-05-07T20:10:49.0323424Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.0323779Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:10:49.0323785Z 2025-05-07T20:10:49.0353882Z 2025-05-07T20:10:49.0354771Z Dynamic section at offset 0x71b10 contains 39 entries: 2025-05-07T20:10:49.0355170Z Tag Type Name/Value 2025-05-07T20:10:49.0355719Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.0355936Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.0356174Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.0356379Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.0356584Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.0356806Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.0358397Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.0358608Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.0358953Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.0359272Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.0359463Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.0359654Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:49.0359914Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.0360099Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.0360308Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.0360578Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:49.0360699Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:10:49.0360811Z 0x000000000000000d (FINI) 0x316ac 2025-05-07T20:10:49.0360943Z 0x0000000000000019 (INIT_ARRAY) 0x71130 2025-05-07T20:10:49.0361066Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:10:49.0361180Z 0x000000000000001a (FINI_ARRAY) 0x71158 2025-05-07T20:10:49.0361298Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.0361432Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.0361545Z 0x0000000000000005 (STRTAB) 0x2ba8 2025-05-07T20:10:49.0361654Z 0x0000000000000006 (SYMTAB) 0xa18 2025-05-07T20:10:49.0361805Z 0x000000000000000a (STRSZ) 36158 (bytes) 2025-05-07T20:10:49.0361926Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.0362038Z 0x0000000000000003 (PLTGOT) 0x71fe8 2025-05-07T20:10:49.0362190Z 0x0000000000000002 (PLTRELSZ) 5520 (bytes) 2025-05-07T20:10:49.0362332Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.0362442Z 0x0000000000000017 (JMPREL) 0xdfa8 2025-05-07T20:10:49.0362552Z 0x0000000000000007 (RELA) 0xbcc8 2025-05-07T20:10:49.0362691Z 0x0000000000000008 (RELASZ) 8928 (bytes) 2025-05-07T20:10:49.0362808Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.0362922Z 0x000000006ffffffe (VERNEED) 0xbbb8 2025-05-07T20:10:49.0363050Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:49.0363194Z 0x000000006ffffff0 (VERSYM) 0xb8e6 2025-05-07T20:10:49.0363302Z 0x000000006ffffff9 (RELACOUNT) 162 2025-05-07T20:10:49.0363401Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.0363423Z 2025-05-07T20:10:49.0363535Z ################################################################################ 2025-05-07T20:10:49.0363540Z 2025-05-07T20:10:49.0363545Z 2025-05-07T20:10:49.0363654Z ################################################################################ 2025-05-07T20:10:49.0363947Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:49.0364050Z [CHECK] Listing out library size: 2025-05-07T20:10:49.0364318Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:49.0364323Z 2025-05-07T20:10:49.0367168Z 35 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:49.0368380Z 2025-05-07T20:10:49.0369551Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:49.0370111Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.0370118Z 2025-05-07T20:10:49.0485732Z GLIBC_2.2.5 2025-05-07T20:10:49.0486370Z GLIBC_2.3 2025-05-07T20:10:49.0486592Z GLIBC_2.14 2025-05-07T20:10:49.0486611Z 2025-05-07T20:10:49.0486633Z 2025-05-07T20:10:49.0487930Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:49.0489549Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.0489565Z 2025-05-07T20:10:49.0610333Z GLIBCXX_3.4 2025-05-07T20:10:49.0610605Z GLIBCXX_3.4.9 2025-05-07T20:10:49.0610940Z GLIBCXX_3.4.11 2025-05-07T20:10:49.0611179Z GLIBCXX_3.4.15 2025-05-07T20:10:49.0611720Z GLIBCXX_3.4.18 2025-05-07T20:10:49.0611971Z GLIBCXX_3.4.20 2025-05-07T20:10:49.0612202Z GLIBCXX_3.4.21 2025-05-07T20:10:49.0612221Z 2025-05-07T20:10:49.0612235Z 2025-05-07T20:10:49.0632228Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.hkUVMx9aDF.symbols.txt 2025-05-07T20:10:49.0632267Z 2025-05-07T20:10:49.0712806Z 2025-05-07T20:10:49.0738474Z [CHECK] Total Number of symbols: 1545 2025-05-07T20:10:49.0753722Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:10:49.0769565Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.c7LL86L09G.usymbols.txt 2025-05-07T20:10:49.0769732Z 2025-05-07T20:10:49.0792130Z 2025-05-07T20:10:49.0830247Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:10:49.0853947Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.0854990Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.0855487Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.0855910Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.0856332Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.0856746Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.0857430Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.0857817Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.0858160Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.0858561Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.0858895Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:49.0859190Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.0859620Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.0859905Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.0860021Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:49.0860141Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:49.0860250Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:49.0860363Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:49.0860475Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:49.0860591Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.0860707Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.0860809Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:49.0860934Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.0861035Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.0861154Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:49.0861317Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:49.0861496Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:49.0861627Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:49.0861752Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:49.0861917Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:49.0863234Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.0863352Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:49.0863493Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:49.0863648Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:10:49.0863814Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.0864435Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.0865347Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.0865548Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.0865730Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.0865923Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:49.0866112Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.0866427Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.0866639Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:49.0866780Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:49.0866961Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.0867163Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:10:49.0867445Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:49.0867699Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:49.0868011Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:49.0868714Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:49.0868903Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.0869070Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.0869564Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.0870159Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.0870312Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.0870443Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:49.0870608Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:49.0870715Z U at::globalContext() 2025-05-07T20:10:49.0870872Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:10:49.0871008Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:49.0871107Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:49.0871356Z U bool at::Tensor::item() const 2025-05-07T20:10:49.0871463Z U c10::AnyType::get() 2025-05-07T20:10:49.0871679Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:49.0871905Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0872011Z U c10::BoolType::get() 2025-05-07T20:10:49.0872196Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.0872395Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:49.0872518Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:49.0873050Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:49.0873749Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:49.0874136Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.0874244Z U c10::Error::what() const 2025-05-07T20:10:49.0874366Z U c10::GradMode::is_enabled() 2025-05-07T20:10:49.0874480Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:49.0874657Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0874833Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:49.0874955Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:49.0875072Z U c10::IValue::isBoolList() const 2025-05-07T20:10:49.0875197Z U c10::IValue::isIntList() const 2025-05-07T20:10:49.0875315Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:49.0875432Z U c10::IValue::isTensorList() const 2025-05-07T20:10:49.0875592Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.0875723Z U c10::IntType::get() 2025-05-07T20:10:49.0876216Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.0876406Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.0876531Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.0876692Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.0876821Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.0877122Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:49.0877290Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.0877393Z U c10::StringType::get() 2025-05-07T20:10:49.0877559Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.0877966Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.0878104Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.0878241Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.0878352Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:49.0878460Z U c10::SymIntType::get() 2025-05-07T20:10:49.0878634Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.0878757Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:49.0879209Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:49.0879387Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.0879566Z U c10::TensorType::get() 2025-05-07T20:10:49.0879765Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:49.0879891Z U c10::Type::is_module() const 2025-05-07T20:10:49.0880017Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.0880745Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.0880926Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.0881042Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.0881169Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.0881308Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.0881432Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.0881550Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.0881821Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.0881927Z U c10::cuda::device_count() 2025-05-07T20:10:49.0882071Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.0882228Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.0882374Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.0882518Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.0882692Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.0882810Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.0883251Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.0883923Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.0884169Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.0884650Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.0884982Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.0885529Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.0885793Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:49.0886063Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:49.0886255Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:49.0886368Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.0886485Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.0886812Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:49.0886988Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:49.0887126Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.0887294Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.0887408Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.0887548Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.0887705Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:49.0888055Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.0888189Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.0888326Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.0888480Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:49.0888611Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.0888735Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:49.0888838Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.0888951Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.0889135Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.0889262Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.0889388Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.0889507Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.0889644Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.0889757Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.0889874Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.0889991Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.0890101Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.0890220Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.0890332Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.0890475Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.0890592Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.0890729Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.0890847Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.0890954Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.0891072Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.0891196Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.0891385Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:49.0891576Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0891675Z U free@GLIBC_2.2.5 2025-05-07T20:10:49.0891813Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0891903Z U log2f@GLIBC_2.2.5 2025-05-07T20:10:49.0892010Z U long at::Tensor::item() const 2025-05-07T20:10:49.0892188Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.0892317Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.0892458Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.0892564Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:49.0892656Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.0892749Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.0892846Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.0892959Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.0893079Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.0893170Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:49.0893383Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:49.0893706Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.0894106Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.0894415Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.0894765Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.0894889Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.0895018Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.0895153Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.0895300Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.0895459Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.0895585Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.0895717Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:49.0895955Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.0896497Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.0896630Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:49.0896742Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.0896854Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.0896978Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.0897089Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.0897259Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.0897504Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.0897635Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.0897790Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.0897914Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:49.0898118Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.0898518Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:49.0898649Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:49.0898759Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.0898850Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:49.0898939Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.0899066Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.0899623Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.0900058Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.0900311Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.0900427Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:49.0900705Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:49.0900888Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:49.0901105Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:49.0901279Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:49.0901614Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:49.0901756Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:49.0901942Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:49.0902107Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:49.0902248Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:49.0902357Z U torch::autograd::Node::metadata() 2025-05-07T20:10:49.0902493Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:49.0902723Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:49.0902975Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:49.0903118Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:49.0903319Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:49.0903521Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:49.0908942Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:49.0909147Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:49.0909292Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:49.0909444Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:49.0909595Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:49.0909987Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:49.0910524Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.0911155Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.0911587Z U typeinfo for c10::Error 2025-05-07T20:10:49.0911695Z U typeinfo for c10::Type 2025-05-07T20:10:49.0911838Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.0911985Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:49.0912125Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:49.0912247Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:49.0912463Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.0912631Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.0912855Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:49.0913024Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.0913187Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.0913290Z U vtable for c10::Error 2025-05-07T20:10:49.0913407Z U vtable for c10::ListType 2025-05-07T20:10:49.0913746Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.0913915Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.0914156Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.0914284Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:49.0914400Z U vtable for torch::autograd::Node 2025-05-07T20:10:49.0914593Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.0914705Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.0914812Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.0914917Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.0915015Z w __gmon_start__ 2025-05-07T20:10:49.0915116Z w __pthread_key_create 2025-05-07T20:10:49.0915228Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.0915353Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.0915542Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.0915766Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:49.0915774Z 2025-05-07T20:10:49.0915888Z linux-vdso.so.1 (0x00007ffc95144000) 2025-05-07T20:10:49.0915979Z libc10.so => not found 2025-05-07T20:10:49.0916078Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.0916183Z libc10_cuda.so => not found 2025-05-07T20:10:49.0916512Z libnccl.so.2 => not found 2025-05-07T20:10:49.0916605Z libcuda.so.1 => not found 2025-05-07T20:10:49.0917181Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fead8059000) 2025-05-07T20:10:49.0917709Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fead6e00000) 2025-05-07T20:10:49.0917809Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.0917904Z libtorch.so => not found 2025-05-07T20:10:49.0918015Z libtorch_cpu.so => not found 2025-05-07T20:10:49.0918113Z libtorch_cuda.so => not found 2025-05-07T20:10:49.0918211Z libcudart.so.12 => not found 2025-05-07T20:10:49.0918386Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fead6b9c000) 2025-05-07T20:10:49.0918512Z libm.so.6 => /lib64/libm.so.6 (0x00007feada5bf000) 2025-05-07T20:10:49.0918664Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007feada591000) 2025-05-07T20:10:49.0918786Z libc.so.6 => /lib64/libc.so.6 (0x00007fead6994000) 2025-05-07T20:10:49.0918933Z /lib64/ld-linux-x86-64.so.2 (0x00007feada6a2000) 2025-05-07T20:10:49.0919021Z libc10.so => not found 2025-05-07T20:10:49.0919118Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.0919228Z libc10_cuda.so => not found 2025-05-07T20:10:49.0919319Z libnccl.so.2 => not found 2025-05-07T20:10:49.0919411Z libcuda.so.1 => not found 2025-05-07T20:10:49.0919522Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.0919612Z libtorch.so => not found 2025-05-07T20:10:49.0919711Z libtorch_cpu.so => not found 2025-05-07T20:10:49.0919806Z libtorch_cuda.so => not found 2025-05-07T20:10:49.0919913Z libcudart.so.12 => not found 2025-05-07T20:10:49.0920003Z libtorch.so => not found 2025-05-07T20:10:49.0920090Z libc10.so => not found 2025-05-07T20:10:49.0920194Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.0920309Z libc10_cuda.so => not found 2025-05-07T20:10:49.0920402Z libnccl.so.2 => not found 2025-05-07T20:10:49.0920494Z libcuda.so.1 => not found 2025-05-07T20:10:49.0920601Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.0920695Z libtorch_cpu.so => not found 2025-05-07T20:10:49.0920791Z libtorch_cuda.so => not found 2025-05-07T20:10:49.0920896Z libcudart.so.12 => not found 2025-05-07T20:10:49.0921051Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fead8003000) 2025-05-07T20:10:49.0921065Z 2025-05-07T20:10:49.0921174Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.0921427Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:10:49.0921466Z 2025-05-07T20:10:49.0953824Z 2025-05-07T20:10:49.0954370Z Dynamic section at offset 0x220d958 contains 42 entries: 2025-05-07T20:10:49.0954513Z Tag Type Name/Value 2025-05-07T20:10:49.0954724Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.0954944Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.0955142Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.0955339Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.0955539Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.0955796Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:49.0956017Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:49.0956237Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.0956444Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.0956646Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.0956855Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.0957177Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.0957382Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.0957583Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:49.0957782Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.0958053Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.0958272Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.0958523Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:49.0958711Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.0958832Z 0x000000000000000c (INIT) 0x56000 2025-05-07T20:10:49.0958958Z 0x000000000000000d (FINI) 0x1515ac 2025-05-07T20:10:49.0959082Z 0x0000000000000019 (INIT_ARRAY) 0x220b430 2025-05-07T20:10:49.0959213Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:10:49.0959343Z 0x000000000000001a (FINI_ARRAY) 0x220b4c0 2025-05-07T20:10:49.0959460Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.0959574Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.0959687Z 0x0000000000000005 (STRTAB) 0xbb50 2025-05-07T20:10:49.0959804Z 0x0000000000000006 (SYMTAB) 0x2a60 2025-05-07T20:10:49.0959939Z 0x000000000000000a (STRSZ) 242227 (bytes) 2025-05-07T20:10:49.0960057Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.0960187Z 0x0000000000000003 (PLTGOT) 0x220efe8 2025-05-07T20:10:49.0960324Z 0x0000000000000002 (PLTRELSZ) 16872 (bytes) 2025-05-07T20:10:49.0960429Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.0960552Z 0x0000000000000017 (JMPREL) 0x512d8 2025-05-07T20:10:49.0960700Z 0x0000000000000007 (RELA) 0x47af8 2025-05-07T20:10:49.0960831Z 0x0000000000000008 (RELASZ) 38880 (bytes) 2025-05-07T20:10:49.0960969Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.0961099Z 0x000000006ffffffe (VERNEED) 0x47998 2025-05-07T20:10:49.0961207Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:49.0961324Z 0x000000006ffffff0 (VERSYM) 0x46d84 2025-05-07T20:10:49.0961447Z 0x000000006ffffff9 (RELACOUNT) 571 2025-05-07T20:10:49.0961545Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.0961551Z 2025-05-07T20:10:49.0961670Z ################################################################################ 2025-05-07T20:10:49.0961712Z 2025-05-07T20:10:49.0961717Z 2025-05-07T20:10:49.0961848Z ################################################################################ 2025-05-07T20:10:49.0962092Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:49.0962207Z [CHECK] Listing out library size: 2025-05-07T20:10:49.0962463Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:49.0962468Z 2025-05-07T20:10:49.0971461Z 73 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:49.0972691Z 2025-05-07T20:10:49.0973758Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:49.0974244Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.0974251Z 2025-05-07T20:10:49.1395802Z GLIBC_2.2.5 2025-05-07T20:10:49.1396062Z GLIBC_2.3 2025-05-07T20:10:49.1396281Z GLIBC_2.14 2025-05-07T20:10:49.1397995Z 2025-05-07T20:10:49.1398049Z 2025-05-07T20:10:49.1398599Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:49.1399720Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.1400313Z 2025-05-07T20:10:49.1809669Z GLIBCXX_3.4 2025-05-07T20:10:49.1809992Z GLIBCXX_3.4.9 2025-05-07T20:10:49.1810394Z GLIBCXX_3.4.11 2025-05-07T20:10:49.1810648Z GLIBCXX_3.4.14 2025-05-07T20:10:49.1810873Z GLIBCXX_3.4.15 2025-05-07T20:10:49.1811225Z GLIBCXX_3.4.18 2025-05-07T20:10:49.1811450Z GLIBCXX_3.4.19 2025-05-07T20:10:49.1811672Z GLIBCXX_3.4.20 2025-05-07T20:10:49.1811878Z GLIBCXX_3.4.21 2025-05-07T20:10:49.1812005Z 2025-05-07T20:10:49.1812010Z 2025-05-07T20:10:49.1837312Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.L2Tepwx5Yh.symbols.txt 2025-05-07T20:10:49.1838672Z 2025-05-07T20:10:49.2175663Z 2025-05-07T20:10:49.2204882Z [CHECK] Total Number of symbols: 6648 2025-05-07T20:10:49.2229623Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:10:49.2247919Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.oFtdbXm6pM.usymbols.txt 2025-05-07T20:10:49.2248597Z 2025-05-07T20:10:49.2286645Z 2025-05-07T20:10:49.2313142Z [CHECK] Listing out undefined symbols (465 total): 2025-05-07T20:10:49.2330018Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.2331076Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.2331900Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.2332329Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:10:49.2332907Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.2333486Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.2334023Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.2334620Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.2335508Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.2335961Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.2336326Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.2336702Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:49.2337040Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.2337354Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.2337685Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.2338003Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:49.2338414Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:49.2338735Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:49.2339081Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:49.2339408Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:10:49.2339742Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.2340078Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.2340404Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:49.2340731Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.2341056Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:10:49.2341482Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.2341854Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:49.2342271Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:49.2342634Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:49.2343004Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:49.2343376Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:10:49.2343706Z U at::SplitUntil32Bit::end() const 2025-05-07T20:10:49.2344079Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:10:49.2344464Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:10:49.2344976Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:10:49.2345509Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.2346032Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:10:49.2346492Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:10:49.2346897Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:10:49.2347283Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:10:49.2347673Z U at::TensorIteratorBase::numel() const 2025-05-07T20:10:49.2348045Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:10:49.2348523Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:10:49.2349081Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:10:49.2349515Z U at::TensorMaker::make_tensor() 2025-05-07T20:10:49.2349875Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:10:49.2350255Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:10:49.2350752Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.2351410Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.2352033Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:10:49.2352616Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:10:49.2353292Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.2353830Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:10:49.2354330Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:10:49.2354832Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.2355344Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:10:49.2355863Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:10:49.2356317Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.2356912Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:10:49.2357596Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:10:49.2358347Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.2359254Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2360603Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2361562Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:10:49.2362041Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:10:49.2362465Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:10:49.2363253Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2364111Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.2364889Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:10:49.2365596Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:10:49.2366039Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:10:49.2366451Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.2366866Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:10:49.2367277Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.2368189Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2369088Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.2369923Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2370787Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.2371412Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:49.2372040Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:10:49.2372798Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.2375114Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:10:49.2375740Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:10:49.2376229Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:49.2376828Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:10:49.2377317Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:10:49.2377844Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:10:49.2378469Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:10:49.2379151Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:49.2380190Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:10:49.2381093Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:10:49.2381643Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:10:49.2382155Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:49.2382600Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.2383056Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.2383439Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:10:49.2384133Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2385319Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2386475Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:10:49.2387003Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:10:49.2387367Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.2387753Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:10:49.2388145Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:10:49.2388759Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:10:49.2389365Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:49.2389755Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:49.2390136Z U at::get_num_threads() 2025-05-07T20:10:49.2390426Z U at::get_thread_num() 2025-05-07T20:10:49.2390849Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:10:49.2391387Z U at::internal::set_thread_num(int) 2025-05-07T20:10:49.2391851Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:10:49.2392827Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2394171Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.2395229Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:10:49.2395767Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:10:49.2396150Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:49.2396561Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:49.2396947Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:49.2397252Z U bool at::Tensor::item() const 2025-05-07T20:10:49.2397648Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2398045Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2398416Z U c10::AnyType::get() 2025-05-07T20:10:49.2398771Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:10:49.2399243Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2399741Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2400154Z U c10::BoolType::get() 2025-05-07T20:10:49.2400467Z U c10::DeviceObjType::get() 2025-05-07T20:10:49.2400835Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.2401296Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:49.2401704Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:49.2402455Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:49.2403726Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:49.2404889Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.2405479Z U c10::Error::what() const 2025-05-07T20:10:49.2405791Z U c10::FloatType::get() 2025-05-07T20:10:49.2406132Z U c10::GradMode::is_enabled() 2025-05-07T20:10:49.2406462Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:49.2406827Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2407278Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2407740Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:49.2408123Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:49.2408471Z U c10::IValue::isBoolList() const 2025-05-07T20:10:49.2409022Z U c10::IValue::isIntList() const 2025-05-07T20:10:49.2409517Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:49.2409846Z U c10::IValue::isTensorList() const 2025-05-07T20:10:49.2410205Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.2410744Z U c10::InferenceMode::is_enabled() 2025-05-07T20:10:49.2411066Z U c10::IntType::get() 2025-05-07T20:10:49.2411764Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.2412539Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.2412945Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.2413305Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.2413660Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.2414160Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.2414624Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:49.2414997Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:49.2415352Z U c10::ScalarTypeType::get() 2025-05-07T20:10:49.2415844Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:49.2416572Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:10:49.2417196Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.2417583Z U c10::StringType::get() 2025-05-07T20:10:49.2417942Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:49.2418341Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.2418754Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:49.2419415Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.2420076Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.2420469Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:10:49.2420846Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:10:49.2421240Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:49.2421599Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.2421958Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:49.2422332Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:10:49.2422675Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:49.2423004Z U c10::SymIntType::get() 2025-05-07T20:10:49.2423381Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.2423782Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:49.2424456Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:10:49.2425206Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.2425590Z U c10::TensorType::get() 2025-05-07T20:10:49.2426581Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:10:49.2427689Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:10:49.2428118Z U c10::Type::is_module() const 2025-05-07T20:10:49.2428454Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.2429417Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.2430379Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.2430801Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:10:49.2431435Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:10:49.2432164Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:10:49.2432810Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.2433158Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.2433578Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.2433934Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.2434273Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.2434760Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.2435234Z U c10::cuda::current_device() 2025-05-07T20:10:49.2435557Z U c10::cuda::device_count() 2025-05-07T20:10:49.2435908Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.2436322Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.2436719Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.2437109Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.2437520Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.2437901Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.2438577Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.2439658Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.2440540Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.2441574Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.2442542Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.2443614Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.2444604Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:49.2445279Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:10:49.2445900Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:10:49.2446341Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.2446668Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.2447217Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:49.2447845Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:49.2448276Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:10:49.2448644Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:10:49.2449034Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.2449473Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.2449880Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.2450223Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.2450603Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:49.2451240Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.2451854Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:10:49.2452216Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:49.2452586Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:10:49.2453064Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:49.2453433Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:10:49.2453813Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:10:49.2454162Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:10:49.2454540Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.2454933Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.2455338Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:49.2455958Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:49.2456329Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:10:49.2456693Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:49.2457047Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:49.2457396Z U c10::report_overflow(char const*) 2025-05-07T20:10:49.2457744Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.2458086Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:10:49.2458428Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.2458753Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.2459184Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.2459628Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.2459948Z U ceil@GLIBC_2.2.5 2025-05-07T20:10:49.2460259Z U cublasGemmStridedBatchedEx 2025-05-07T20:10:49.2460573Z U cublasSetStream_v2 2025-05-07T20:10:49.2460909Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.2461276Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:10:49.2461657Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.2462064Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.2462427Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.2462792Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.2463136Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.2463504Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.2463852Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.2464197Z U cudaFree@libcudart.so.12 2025-05-07T20:10:49.2464523Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:10:49.2465084Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.2465542Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:10:49.2465926Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:10:49.2466299Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.2466664Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.2467016Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.2467369Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:10:49.2467735Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:10:49.2468079Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:10:49.2468416Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.2468764Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:10:49.2469096Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:10:49.2469446Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:10:49.2469792Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:10:49.2470143Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:10:49.2470474Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:10:49.2470992Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:10:49.2471656Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:10:49.2471994Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:10:49.2472335Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.2472685Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.2473047Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.2473429Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2473862Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2474293Z U exit@GLIBC_2.2.5 2025-05-07T20:10:49.2474575Z U exp10@GLIBC_2.2.5 2025-05-07T20:10:49.2474861Z U exp2@GLIBC_2.2.5 2025-05-07T20:10:49.2475133Z U exp@GLIBC_2.2.5 2025-05-07T20:10:49.2475424Z U expf@GLIBC_2.2.5 2025-05-07T20:10:49.2475812Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:49.2476330Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:49.2476848Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:10:49.2477360Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:49.2477879Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:49.2478328Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2478743Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2479119Z U fmod@GLIBC_2.2.5 2025-05-07T20:10:49.2479387Z U free@GLIBC_2.2.5 2025-05-07T20:10:49.2479698Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:10:49.2480029Z U int at::Tensor::item() const 2025-05-07T20:10:49.2480455Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:49.2480862Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2481249Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2481603Z U isnan@GLIBC_2.2.5 2025-05-07T20:10:49.2481926Z U lgamma@GLIBC_2.2.5 2025-05-07T20:10:49.2482221Z U llrint@GLIBC_2.2.5 2025-05-07T20:10:49.2482506Z U llround@GLIBC_2.2.5 2025-05-07T20:10:49.2482798Z U log10@GLIBC_2.2.5 2025-05-07T20:10:49.2483077Z U log2@GLIBC_2.2.5 2025-05-07T20:10:49.2483362Z U log@GLIBC_2.2.5 2025-05-07T20:10:49.2483630Z U logl@GLIBC_2.2.5 2025-05-07T20:10:49.2483937Z U long at::Tensor::item() const 2025-05-07T20:10:49.2484343Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.2484800Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:10:49.2485225Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2485617Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2485986Z U lrint@GLIBC_2.2.5 2025-05-07T20:10:49.2486266Z U madvise@GLIBC_2.2.5 2025-05-07T20:10:49.2486561Z U malloc@GLIBC_2.2.5 2025-05-07T20:10:49.2486841Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:49.2487131Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.2487416Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.2487700Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.2487994Z U nextafter@GLIBC_2.2.5 2025-05-07T20:10:49.2488290Z U nvmlDeviceGetCount_v2 2025-05-07T20:10:49.2488619Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:10:49.2488996Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:10:49.2489349Z U nvmlDeviceGetNvLinkState 2025-05-07T20:10:49.2489662Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:10:49.2489968Z U nvmlInit_v2 2025-05-07T20:10:49.2490258Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.2490602Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.2490973Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.2491296Z U pow@GLIBC_2.2.5 2025-05-07T20:10:49.2491582Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:49.2491961Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2492441Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2492867Z U sin@GLIBC_2.2.5 2025-05-07T20:10:49.2493266Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:49.2493813Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:49.2494303Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:10:49.2494816Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:10:49.2495521Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:10:49.2496380Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.2497268Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.2498139Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.2499012Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.2499917Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.2500584Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:10:49.2500953Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:10:49.2501321Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.2501658Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.2502004Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:10:49.2502355Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:49.2502759Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.2503171Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.2503563Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.2504002Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.2504411Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.2504803Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:49.2505258Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:10:49.2505933Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:49.2506651Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:10:49.2507245Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.2507970Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:10:49.2508715Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:10:49.2509668Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.2510551Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:10:49.2510929Z U std::cout@GLIBCXX_3.4 2025-05-07T20:10:49.2511405Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:10:49.2511844Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:49.2512218Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.2512602Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.2512963Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.2513336Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.2513781Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:10:49.2514287Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.2514853Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.2515341Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:10:49.2515717Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.2516086Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:10:49.2516460Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:10:49.2516892Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.2517301Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:49.2517763Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.2518504Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:49.2519217Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:49.2519591Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.2519904Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:49.2520206Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.2520489Z U sysconf@GLIBC_2.2.5 2025-05-07T20:10:49.2520815Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.2521648Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.2522825Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.2524023Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:10:49.2524884Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.2525367Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:49.2525899Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:49.2526664Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:49.2527158Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:49.2527705Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:49.2528355Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:49.2528981Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:49.2529450Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:49.2529933Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:49.2530362Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:49.2530729Z U torch::autograd::Node::metadata() 2025-05-07T20:10:49.2531097Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:49.2531584Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:49.2532233Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:49.2532776Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:49.2533239Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:49.2533801Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:49.2536881Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:49.2539834Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:49.2540278Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:49.2540719Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:49.2541164Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:49.2541847Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:49.2542745Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.2543634Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.2544347Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:10:49.2544797Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:10:49.2545586Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.2546364Z U typeinfo for c10::Error 2025-05-07T20:10:49.2546683Z U typeinfo for c10::Type 2025-05-07T20:10:49.2547025Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.2547407Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:49.2547772Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:49.2548178Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:49.2548545Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:49.2548955Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.2549484Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.2550261Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:49.2551432Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:49.2552674Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:49.2553781Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:10:49.2554891Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:10:49.2555981Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:10:49.2557092Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:49.2558249Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:49.2559450Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:10:49.2560702Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:49.2562195Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:10:49.2563066Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.2563530Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.2563976Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:49.2564402Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.2565013Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.2565475Z U vtable for at::TensorIterator 2025-05-07T20:10:49.2565813Z U vtable for at::TensorIteratorBase 2025-05-07T20:10:49.2566157Z U vtable for c10::Error 2025-05-07T20:10:49.2566463Z U vtable for c10::ListType 2025-05-07T20:10:49.2567022Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.2567618Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.2568087Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.2568572Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:10:49.2568919Z U vtable for torch::autograd::Node 2025-05-07T20:10:49.2569328Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.2569730Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.2570058Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.2570369Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.2570713Z w __gmon_start__ 2025-05-07T20:10:49.2570997Z w __pthread_key_create 2025-05-07T20:10:49.2571302Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.2571635Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.2571943Z w pthread_once 2025-05-07T20:10:49.2572269Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.2572700Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:49.2573000Z 2025-05-07T20:10:49.2573169Z linux-vdso.so.1 (0x00007fff0bdac000) 2025-05-07T20:10:49.2573459Z libc10.so => not found 2025-05-07T20:10:49.2573757Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2574018Z libc10_cuda.so => not found 2025-05-07T20:10:49.2574284Z libnccl.so.2 => not found 2025-05-07T20:10:49.2574533Z libcuda.so.1 => not found 2025-05-07T20:10:49.2575071Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007f2abca00000) 2025-05-07T20:10:49.2576140Z fbgemm_gpu_embedding_inplace_ops.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so (0x00007f2abcf8d000) 2025-05-07T20:10:49.2577446Z fbgemm_gpu_tbe_index_select.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so (0x00007f2aba600000) 2025-05-07T20:10:49.2578516Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f2ab8e00000) 2025-05-07T20:10:49.2579568Z fbgemm_gpu_tbe_optimizers.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so (0x00007f2ab8400000) 2025-05-07T20:10:49.2580286Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2580564Z libtorch.so => not found 2025-05-07T20:10:49.2581251Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f2ab8259000) 2025-05-07T20:10:49.2582404Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f2ab7000000) 2025-05-07T20:10:49.2583054Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2583163Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2583259Z libcudart.so.12 => not found 2025-05-07T20:10:49.2583456Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2ab6d9c000) 2025-05-07T20:10:49.2583598Z libm.so.6 => /lib64/libm.so.6 (0x00007f2aba525000) 2025-05-07T20:10:49.2583746Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2abcf5f000) 2025-05-07T20:10:49.2583869Z libc.so.6 => /lib64/libc.so.6 (0x00007f2ab6b94000) 2025-05-07T20:10:49.2583998Z /lib64/ld-linux-x86-64.so.2 (0x00007f2ac1d81000) 2025-05-07T20:10:49.2584096Z libc10.so => not found 2025-05-07T20:10:49.2584193Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2584284Z libc10_cuda.so => not found 2025-05-07T20:10:49.2584385Z libnccl.so.2 => not found 2025-05-07T20:10:49.2584477Z libcuda.so.1 => not found 2025-05-07T20:10:49.2584835Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f2abc989000) 2025-05-07T20:10:49.2584948Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2585038Z libtorch.so => not found 2025-05-07T20:10:49.2585133Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2585230Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2585336Z libtorch.so => not found 2025-05-07T20:10:49.2585425Z libc10.so => not found 2025-05-07T20:10:49.2585519Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2585610Z libc10_cuda.so => not found 2025-05-07T20:10:49.2585716Z libnccl.so.2 => not found 2025-05-07T20:10:49.2585807Z libcuda.so.1 => not found 2025-05-07T20:10:49.2585904Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2586014Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2586110Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2586202Z libcudart.so.12 => not found 2025-05-07T20:10:49.2586393Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2aba4cf000) 2025-05-07T20:10:49.2586481Z libc10.so => not found 2025-05-07T20:10:49.2586574Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2586663Z libc10_cuda.so => not found 2025-05-07T20:10:49.2586767Z libnccl.so.2 => not found 2025-05-07T20:10:49.2586856Z libcuda.so.1 => not found 2025-05-07T20:10:49.2586954Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2587062Z libtorch.so => not found 2025-05-07T20:10:49.2587156Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2587251Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2587345Z libcudart.so.12 => not found 2025-05-07T20:10:49.2587475Z libtorch.so => not found 2025-05-07T20:10:49.2587562Z libc10.so => not found 2025-05-07T20:10:49.2587654Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2587756Z libc10_cuda.so => not found 2025-05-07T20:10:49.2587847Z libnccl.so.2 => not found 2025-05-07T20:10:49.2587935Z libcuda.so.1 => not found 2025-05-07T20:10:49.2588032Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2588142Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2588239Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2588331Z libcudart.so.12 => not found 2025-05-07T20:10:49.2588434Z libtorch.so => not found 2025-05-07T20:10:49.2588520Z libc10.so => not found 2025-05-07T20:10:49.2588611Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2588704Z libc10_cuda.so => not found 2025-05-07T20:10:49.2588806Z libnccl.so.2 => not found 2025-05-07T20:10:49.2588896Z libcuda.so.1 => not found 2025-05-07T20:10:49.2588993Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2589098Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2589195Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2589287Z libcudart.so.12 => not found 2025-05-07T20:10:49.2589373Z libc10.so => not found 2025-05-07T20:10:49.2589479Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2589569Z libc10_cuda.so => not found 2025-05-07T20:10:49.2589660Z libnccl.so.2 => not found 2025-05-07T20:10:49.2589765Z libcuda.so.1 => not found 2025-05-07T20:10:49.2589863Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2589978Z libtorch.so => not found 2025-05-07T20:10:49.2590074Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2590180Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2590275Z libcudart.so.12 => not found 2025-05-07T20:10:49.2590368Z libtorch.so => not found 2025-05-07T20:10:49.2590471Z libc10.so => not found 2025-05-07T20:10:49.2590612Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2590707Z libc10_cuda.so => not found 2025-05-07T20:10:49.2590799Z libnccl.so.2 => not found 2025-05-07T20:10:49.2590901Z libcuda.so.1 => not found 2025-05-07T20:10:49.2591001Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2591097Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2591261Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2591371Z libcudart.so.12 => not found 2025-05-07T20:10:49.2591463Z libtorch.so => not found 2025-05-07T20:10:49.2591550Z libc10.so => not found 2025-05-07T20:10:49.2591825Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.2591919Z libc10_cuda.so => not found 2025-05-07T20:10:49.2592015Z libnccl.so.2 => not found 2025-05-07T20:10:49.2592107Z libcuda.so.1 => not found 2025-05-07T20:10:49.2592221Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.2592317Z libtorch_cpu.so => not found 2025-05-07T20:10:49.2592480Z libtorch_cuda.so => not found 2025-05-07T20:10:49.2592637Z librt.so.1 => /lib64/librt.so.1 (0x00007f2ab8dfb000) 2025-05-07T20:10:49.2592819Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2ab8df6000) 2025-05-07T20:10:49.2592824Z 2025-05-07T20:10:49.2592937Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.2593157Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:10:49.2593162Z 2025-05-07T20:10:49.2593167Z 2025-05-07T20:10:49.2593332Z Dynamic section at offset 0x48e4fa8 contains 47 entries: 2025-05-07T20:10:49.2593452Z Tag Type Name/Value 2025-05-07T20:10:49.2593663Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.2593903Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.2594104Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.2594305Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.2594521Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.2594714Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:49.2594978Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:10:49.2596294Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:10:49.2596518Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:49.2596754Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:10:49.2596986Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.2597187Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.2597440Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:49.2597680Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:49.2597887Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.2598095Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.2598318Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.2598521Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.2598714Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:49.2598912Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.2599152Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.2599371Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.2599580Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:10:49.2599808Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.2599931Z 0x000000000000000c (INIT) 0x1bb000 2025-05-07T20:10:49.2600047Z 0x000000000000000d (FINI) 0x75816c 2025-05-07T20:10:49.2600186Z 0x0000000000000019 (INIT_ARRAY) 0x48d6858 2025-05-07T20:10:49.2600326Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:10:49.2600449Z 0x000000000000001a (FINI_ARRAY) 0x48d6ce0 2025-05-07T20:10:49.2600574Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.2600707Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.2600826Z 0x0000000000000005 (STRTAB) 0x33248 2025-05-07T20:10:49.2600943Z 0x0000000000000006 (SYMTAB) 0xc2f0 2025-05-07T20:10:49.2614991Z 0x000000000000000a (STRSZ) 1276767 (bytes) 2025-05-07T20:10:49.2615206Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.2615337Z 0x0000000000000003 (PLTGOT) 0x48eafe8 2025-05-07T20:10:49.2615488Z 0x0000000000000002 (PLTRELSZ) 68808 (bytes) 2025-05-07T20:10:49.2615608Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.2615720Z 0x0000000000000017 (JMPREL) 0x1a9648 2025-05-07T20:10:49.2615831Z 0x0000000000000007 (RELA) 0x16e320 2025-05-07T20:10:49.2615969Z 0x0000000000000008 (RELASZ) 242472 (bytes) 2025-05-07T20:10:49.2616095Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.2616212Z 0x000000006ffffffe (VERNEED) 0x16e1a0 2025-05-07T20:10:49.2616320Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:49.2616529Z 0x000000006ffffff0 (VERSYM) 0x16ada8 2025-05-07T20:10:49.2616643Z 0x000000006ffffff9 (RELACOUNT) 2870 2025-05-07T20:10:49.2616742Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.2616749Z 2025-05-07T20:10:49.2616875Z ################################################################################ 2025-05-07T20:10:49.2616881Z 2025-05-07T20:10:49.2616885Z 2025-05-07T20:10:49.2616998Z ################################################################################ 2025-05-07T20:10:49.2617311Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.2617455Z [CHECK] Listing out library size: 2025-05-07T20:10:49.2617752Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.2617757Z 2025-05-07T20:10:49.2617990Z 904 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.2617997Z 2025-05-07T20:10:49.2618431Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.2618948Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.2618953Z 2025-05-07T20:10:49.4389512Z GLIBC_2.2.5 2025-05-07T20:10:49.4389802Z GLIBC_2.3 2025-05-07T20:10:49.4390239Z GLIBC_2.14 2025-05-07T20:10:49.4390294Z 2025-05-07T20:10:49.4390312Z 2025-05-07T20:10:49.4392076Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.4393802Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.4393820Z 2025-05-07T20:10:49.6328649Z GLIBCXX_3.4 2025-05-07T20:10:49.6329708Z GLIBCXX_3.4.9 2025-05-07T20:10:49.6329983Z GLIBCXX_3.4.11 2025-05-07T20:10:49.6330444Z GLIBCXX_3.4.14 2025-05-07T20:10:49.6330675Z GLIBCXX_3.4.15 2025-05-07T20:10:49.6330905Z GLIBCXX_3.4.18 2025-05-07T20:10:49.6331117Z GLIBCXX_3.4.20 2025-05-07T20:10:49.6331351Z GLIBCXX_3.4.21 2025-05-07T20:10:49.6331485Z 2025-05-07T20:10:49.6331503Z 2025-05-07T20:10:49.6351169Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.OS1CN9d22H.symbols.txt 2025-05-07T20:10:49.6353002Z 2025-05-07T20:10:49.8277256Z 2025-05-07T20:10:49.8354269Z [CHECK] Total Number of symbols: 12682 2025-05-07T20:10:49.8447074Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:10:49.8464292Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.tLPVFCLWlv.usymbols.txt 2025-05-07T20:10:49.8465126Z 2025-05-07T20:10:49.8523456Z 2025-05-07T20:10:49.8550598Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:10:49.8568873Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.8571280Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.8572887Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.8573933Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.8575085Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.8576215Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.8577309Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.8578424Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.8579490Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.8580574Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.8581033Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:49.8581559Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.8581886Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.8582195Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.8582521Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:49.8582839Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:49.8583169Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:49.8583492Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:49.8583790Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.8584166Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.8584469Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:49.8584786Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.8585089Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.8585424Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:49.8585836Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:49.8586237Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:49.8586651Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:49.8587043Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:49.8587401Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:49.8587758Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:49.8588382Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.8589015Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:10:49.8589601Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.8590524Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.8592193Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.8594443Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:49.8595547Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.8596747Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:49.8597319Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:10:49.8597886Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.8598615Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.8599713Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.8600525Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:49.8600938Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.8601293Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:49.8601684Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:49.8602038Z U at::get_thread_num() 2025-05-07T20:10:49.8602382Z U at::globalContext() 2025-05-07T20:10:49.8602678Z U at::internal::set_thread_num(int) 2025-05-07T20:10:49.8603024Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:49.8603438Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:10:49.8603846Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:10:49.8604194Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:49.8604473Z U c10::AnyType::get() 2025-05-07T20:10:49.8604865Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.8605298Z U c10::BoolType::get() 2025-05-07T20:10:49.8605645Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.8606280Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:49.8606691Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:49.8607451Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:49.8608707Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:49.8609823Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.8610610Z U c10::Error::what() const 2025-05-07T20:10:49.8611046Z U c10::FloatType::get() 2025-05-07T20:10:49.8611366Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:49.8611709Z U c10::GradMode::is_enabled() 2025-05-07T20:10:49.8612022Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:49.8612540Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.8612949Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.8613385Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:49.8613763Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:49.8614109Z U c10::IValue::isBoolList() const 2025-05-07T20:10:49.8614434Z U c10::IValue::isIntList() const 2025-05-07T20:10:49.8614747Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:49.8615073Z U c10::IValue::isTensorList() const 2025-05-07T20:10:49.8615428Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.8615758Z U c10::IntType::get() 2025-05-07T20:10:49.8616115Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.8616494Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.8616845Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.8617186Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.8617621Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.8618068Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:10:49.8618404Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:10:49.8618891Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:49.8619409Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.8619963Z U c10::StringType::get() 2025-05-07T20:10:49.8620322Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.8620718Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:49.8621490Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.8622330Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.8622724Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.8623085Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:49.8623415Z U c10::SymIntType::get() 2025-05-07T20:10:49.8623793Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.8624181Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:49.8624615Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.8624989Z U c10::TensorType::get() 2025-05-07T20:10:49.8625330Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.8626311Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.8627289Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.8627674Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.8628030Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.8628388Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.8628744Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.8629091Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.8629580Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.8630056Z U c10::cuda::device_count() 2025-05-07T20:10:49.8630421Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.8630827Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.8631369Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.8631777Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.8632191Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.8632630Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.8633310Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.8634384Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.8635301Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.8636197Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.8637154Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.8638214Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.8639038Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.8639391Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.8639955Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:49.8640588Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:49.8641059Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.8641547Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.8641958Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.8642303Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.8642692Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:49.8643330Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.8644060Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:49.8644451Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.8644809Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.8645197Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:49.8645587Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:49.8645956Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:10:49.8646299Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:10:49.8646632Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:10:49.8646976Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.8647275Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.8647597Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.8647981Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.8648397Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.8648752Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.8649091Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.8649447Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.8649775Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.8650146Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.8650469Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.8650794Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.8651111Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.8651698Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.8652072Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.8652467Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.8652806Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.8653136Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.8653475Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.8653812Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.8654164Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.8655149Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:10:49.8656343Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:10:49.8656910Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:10:49.8657332Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:49.8657751Z U float at::Tensor::item() const 2025-05-07T20:10:49.8658118Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.8658516Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.8658875Z U free@GLIBC_2.2.5 2025-05-07T20:10:49.8659182Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.8659650Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.8660266Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.8660689Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.8661090Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.8661460Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:49.8661757Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.8662047Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.8662344Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.8663002Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.8663348Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.8663924Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.8664876Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.8665634Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.8666418Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.8667188Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.8667958Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.8668502Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:49.8669165Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:10:49.8670226Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:10:49.8670854Z U sqrt@GLIBC_2.2.5 2025-05-07T20:10:49.8671150Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:10:49.8671731Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:49.8672425Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.8673289Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.8674129Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.8674964Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.8675592Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.8675937Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.8676316Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:10:49.8676707Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.8677114Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.8677550Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.8677968Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.8678367Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:49.8678858Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.8679816Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.8680697Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:49.8681055Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.8681422Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.8681766Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.8682121Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.8682536Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.8683121Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.8683726Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.8684121Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.8684543Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:49.8685313Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:49.8685952Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:49.8686294Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.8686585Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:49.8686856Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.8687148Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.8688130Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.8689316Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.8690342Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.8690856Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:49.8691450Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:49.8692049Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:49.8692575Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:49.8693086Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:49.8693756Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:49.8694388Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:49.8694844Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:49.8695351Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:49.8695769Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:49.8696138Z U torch::autograd::Node::metadata() 2025-05-07T20:10:49.8696513Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:49.8697004Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:49.8697656Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:49.8698187Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:49.8698673Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:49.8699234Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:49.8702343Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:49.8705403Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:49.8705842Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:49.8706307Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:49.8706777Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:49.8707480Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:49.8708406Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.8709473Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.8710251Z U typeinfo for c10::Error 2025-05-07T20:10:49.8710629Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.8711046Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:49.8711537Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:49.8711934Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:49.8712382Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:49.8713744Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:49.8716028Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:10:49.8717378Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.8717834Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.8718289Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:49.8718733Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.8719329Z U vtable for c10::Error 2025-05-07T20:10:49.8719880Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.8720489Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.8720990Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.8721453Z U vtable for torch::autograd::Node 2025-05-07T20:10:49.8721906Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.8722320Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.8722667Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.8722987Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.8723309Z w __gmon_start__ 2025-05-07T20:10:49.8723596Z w __pthread_key_create 2025-05-07T20:10:49.8724044Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.8724370Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.8724718Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.8725228Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.8725565Z 2025-05-07T20:10:49.8725731Z linux-vdso.so.1 (0x00007fff30db7000) 2025-05-07T20:10:49.8726191Z libc10.so => not found 2025-05-07T20:10:49.8726519Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8726804Z libc10_cuda.so => not found 2025-05-07T20:10:49.8727067Z libnccl.so.2 => not found 2025-05-07T20:10:49.8727520Z libcuda.so.1 => not found 2025-05-07T20:10:49.8728193Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f66fbc00000) 2025-05-07T20:10:49.8729274Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f66fb800000) 2025-05-07T20:10:49.8730406Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f6737152000) 2025-05-07T20:10:49.8731188Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8731484Z libtorch.so => not found 2025-05-07T20:10:49.8732006Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007f66fb200000) 2025-05-07T20:10:49.8732998Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f66fa000000) 2025-05-07T20:10:49.8733679Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8733979Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8734257Z libcudart.so.12 => not found 2025-05-07T20:10:49.8734606Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f66f9d9c000) 2025-05-07T20:10:49.8735054Z libm.so.6 => /lib64/libm.so.6 (0x00007f66fd325000) 2025-05-07T20:10:49.8735443Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f6737120000) 2025-05-07T20:10:49.8735845Z libc.so.6 => /lib64/libc.so.6 (0x00007f66f9b94000) 2025-05-07T20:10:49.8736212Z /lib64/ld-linux-x86-64.so.2 (0x00007f67372ff000) 2025-05-07T20:10:49.8736561Z libtorch.so => not found 2025-05-07T20:10:49.8736815Z libc10.so => not found 2025-05-07T20:10:49.8737079Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8737347Z libc10_cuda.so => not found 2025-05-07T20:10:49.8737740Z libnccl.so.2 => not found 2025-05-07T20:10:49.8738010Z libcuda.so.1 => not found 2025-05-07T20:10:49.8738269Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8738668Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8738922Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8739190Z libcudart.so.12 => not found 2025-05-07T20:10:49.8739495Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f66fd2cf000) 2025-05-07T20:10:49.8739841Z libc10.so => not found 2025-05-07T20:10:49.8740076Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8740339Z libc10_cuda.so => not found 2025-05-07T20:10:49.8740585Z libnccl.so.2 => not found 2025-05-07T20:10:49.8740847Z libcuda.so.1 => not found 2025-05-07T20:10:49.8741438Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007f673710f000) 2025-05-07T20:10:49.8742055Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8742330Z libtorch.so => not found 2025-05-07T20:10:49.8742571Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8742835Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8743129Z libcudart.so.12 => not found 2025-05-07T20:10:49.8743390Z libc10.so => not found 2025-05-07T20:10:49.8743625Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8743891Z libc10_cuda.so => not found 2025-05-07T20:10:49.8744142Z libnccl.so.2 => not found 2025-05-07T20:10:49.8744382Z libcuda.so.1 => not found 2025-05-07T20:10:49.8744632Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8744883Z libtorch.so => not found 2025-05-07T20:10:49.8745134Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8745385Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8745637Z libcudart.so.12 => not found 2025-05-07T20:10:49.8745908Z libc10.so => not found 2025-05-07T20:10:49.8746142Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8746384Z libc10_cuda.so => not found 2025-05-07T20:10:49.8746637Z libnccl.so.2 => not found 2025-05-07T20:10:49.8746874Z libcuda.so.1 => not found 2025-05-07T20:10:49.8747376Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f66fbb89000) 2025-05-07T20:10:49.8748101Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8748414Z libtorch.so => not found 2025-05-07T20:10:49.8748670Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8749020Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8749284Z libtorch.so => not found 2025-05-07T20:10:49.8749528Z libc10.so => not found 2025-05-07T20:10:49.8749781Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8750038Z libc10_cuda.so => not found 2025-05-07T20:10:49.8750298Z libnccl.so.2 => not found 2025-05-07T20:10:49.8750545Z libcuda.so.1 => not found 2025-05-07T20:10:49.8750806Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8751081Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8751438Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8751893Z libcudart.so.12 => not found 2025-05-07T20:10:49.8752158Z libtorch.so => not found 2025-05-07T20:10:49.8752497Z libc10.so => not found 2025-05-07T20:10:49.8752741Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8753012Z libc10_cuda.so => not found 2025-05-07T20:10:49.8753305Z libnccl.so.2 => not found 2025-05-07T20:10:49.8753560Z libcuda.so.1 => not found 2025-05-07T20:10:49.8753809Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8754090Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8754363Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8754737Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f66fbb82000) 2025-05-07T20:10:49.8755130Z libtorch.so => not found 2025-05-07T20:10:49.8755381Z libc10.so => not found 2025-05-07T20:10:49.8755630Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.8755889Z libc10_cuda.so => not found 2025-05-07T20:10:49.8756159Z libnccl.so.2 => not found 2025-05-07T20:10:49.8756410Z libcuda.so.1 => not found 2025-05-07T20:10:49.8756679Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.8756948Z libtorch_cpu.so => not found 2025-05-07T20:10:49.8757229Z libtorch_cuda.so => not found 2025-05-07T20:10:49.8757539Z librt.so.1 => /lib64/librt.so.1 (0x00007f66fbb79000) 2025-05-07T20:10:49.8757790Z 2025-05-07T20:10:49.8757900Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.8758377Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:10:49.8758763Z 2025-05-07T20:10:49.8758767Z 2025-05-07T20:10:49.8758929Z Dynamic section at offset 0x38775ba0 contains 45 entries: 2025-05-07T20:10:49.8759335Z Tag Type Name/Value 2025-05-07T20:10:49.8759757Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.8760270Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.8760797Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.8761309Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.8761820Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.8762348Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:10:49.8762957Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:10:49.8763549Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:10:49.8764121Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.8764646Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.8765324Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:10:49.8765860Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:10:49.8766467Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.8767004Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.8767540Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.8768062Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.8768587Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:10:49.8769083Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.8769598Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.8770109Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.8770716Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:49.8771282Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.8771688Z 0x000000000000000c (INIT) 0x652000 2025-05-07T20:10:49.8772035Z 0x000000000000000d (FINI) 0x2f6443c 2025-05-07T20:10:49.8772379Z 0x0000000000000019 (INIT_ARRAY) 0x3871d880 2025-05-07T20:10:49.8772749Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:10:49.8773105Z 0x000000000000001a (FINI_ARRAY) 0x3871dfa0 2025-05-07T20:10:49.8773529Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.8773881Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.8774210Z 0x0000000000000005 (STRTAB) 0x62978 2025-05-07T20:10:49.8774549Z 0x0000000000000006 (SYMTAB) 0x18470 2025-05-07T20:10:49.8774944Z 0x000000000000000a (STRSZ) 5120077 (bytes) 2025-05-07T20:10:49.8775315Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.8775658Z 0x0000000000000003 (PLTGOT) 0x38788fe8 2025-05-07T20:10:49.8776032Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:10:49.8776373Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.8776828Z 0x0000000000000017 (JMPREL) 0x641978 2025-05-07T20:10:49.8777280Z 0x0000000000000007 (RELA) 0x54ae50 2025-05-07T20:10:49.8777609Z 0x0000000000000008 (RELASZ) 1010472 (bytes) 2025-05-07T20:10:49.8777951Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.8778267Z 0x000000006ffffffe (VERNEED) 0x54ace0 2025-05-07T20:10:49.8778581Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:10:49.8778878Z 0x000000006ffffff0 (VERSYM) 0x5449c6 2025-05-07T20:10:49.8779198Z 0x000000006ffffff9 (RELACOUNT) 28262 2025-05-07T20:10:49.8779506Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.8779693Z 2025-05-07T20:10:49.8779798Z ################################################################################ 2025-05-07T20:10:49.8780007Z 2025-05-07T20:10:49.8780011Z 2025-05-07T20:10:49.8780121Z ################################################################################ 2025-05-07T20:10:49.8780640Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.8781158Z [CHECK] Listing out library size: 2025-05-07T20:10:49.8781682Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.8782081Z 2025-05-07T20:10:49.8782319Z 59 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.8782660Z 2025-05-07T20:10:49.8783090Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.8784137Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.8784785Z 2025-05-07T20:10:49.8866480Z GLIBC_2.2.5 2025-05-07T20:10:49.8866747Z GLIBC_2.3 2025-05-07T20:10:49.8866960Z GLIBC_2.14 2025-05-07T20:10:49.8867101Z 2025-05-07T20:10:49.8867105Z 2025-05-07T20:10:49.8867601Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.8868806Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.8869599Z 2025-05-07T20:10:49.9027925Z GLIBCXX_3.4 2025-05-07T20:10:49.9028688Z GLIBCXX_3.4.9 2025-05-07T20:10:49.9029001Z GLIBCXX_3.4.11 2025-05-07T20:10:49.9029242Z GLIBCXX_3.4.15 2025-05-07T20:10:49.9029484Z GLIBCXX_3.4.18 2025-05-07T20:10:49.9029811Z GLIBCXX_3.4.20 2025-05-07T20:10:49.9030020Z GLIBCXX_3.4.21 2025-05-07T20:10:49.9030162Z 2025-05-07T20:10:49.9030167Z 2025-05-07T20:10:49.9050421Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.it0aDCpPSq.symbols.txt 2025-05-07T20:10:49.9052097Z 2025-05-07T20:10:49.9169312Z 2025-05-07T20:10:49.9194732Z [CHECK] Total Number of symbols: 1874 2025-05-07T20:10:49.9213645Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:10:49.9231907Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.xyCb5tOmiP.usymbols.txt 2025-05-07T20:10:49.9234183Z 2025-05-07T20:10:49.9254495Z 2025-05-07T20:10:49.9279758Z [CHECK] Listing out undefined symbols (259 total): 2025-05-07T20:10:49.9296679Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.9299419Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.9301245Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:49.9302252Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.9303426Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:49.9304533Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.9305632Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:49.9306708Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:49.9307761Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:49.9308832Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:49.9309884Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:10:49.9310731Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:49.9311047Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:49.9311515Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:49.9312016Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:10:49.9312365Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:10:49.9312707Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:10:49.9313042Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:10:49.9313374Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:49.9313699Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:49.9314131Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:10:49.9314449Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:49.9314776Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:49.9315088Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:49.9315468Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:49.9315910Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:10:49.9316322Z U at::RecordFunction::currentThreadId() 2025-05-07T20:10:49.9316669Z U at::RecordFunction::end() 2025-05-07T20:10:49.9317062Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:10:49.9317440Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:10:49.9317881Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:49.9318358Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:49.9319238Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.9320586Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.9321526Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:10:49.9322290Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.9323473Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:49.9324575Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:49.9324930Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:10:49.9325297Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:10:49.9325654Z U at::globalContext() 2025-05-07T20:10:49.9325994Z U at::sequence_number::get_and_increment() 2025-05-07T20:10:49.9326315Z U bcmp@GLIBC_2.2.5 2025-05-07T20:10:49.9326592Z U c10::AnyType::get() 2025-05-07T20:10:49.9326959Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.9327365Z U c10::BoolType::get() 2025-05-07T20:10:49.9327702Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:49.9327886Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:10:49.9327998Z U c10::Dispatcher::realSingleton() 2025-05-07T20:10:49.9328489Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:10:49.9329092Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:10:49.9329444Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.9329545Z U c10::Error::what() const 2025-05-07T20:10:49.9329653Z U c10::FloatType::get() 2025-05-07T20:10:49.9329752Z U c10::GradMode::is_enabled() 2025-05-07T20:10:49.9329858Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:10:49.9330037Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.9330393Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:10:49.9330515Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:10:49.9330630Z U c10::IValue::isBoolList() const 2025-05-07T20:10:49.9330780Z U c10::IValue::isIntList() const 2025-05-07T20:10:49.9330891Z U c10::IValue::isSymIntList() const 2025-05-07T20:10:49.9331017Z U c10::IValue::isTensorList() const 2025-05-07T20:10:49.9331159Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:49.9331289Z U c10::IntType::get() 2025-05-07T20:10:49.9331463Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:49.9331586Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:49.9331713Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.9331834Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:10:49.9332075Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.9332352Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:10:49.9332692Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.9332812Z U c10::StringType::get() 2025-05-07T20:10:49.9332965Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:49.9333108Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:49.9333296Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:49.9333444Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:49.9333636Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:10:49.9334090Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:49.9334237Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:49.9334367Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:10:49.9334519Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:49.9334671Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:49.9334808Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:49.9334946Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:49.9335080Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:49.9335193Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:49.9335302Z U c10::SymIntType::get() 2025-05-07T20:10:49.9335456Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:49.9335585Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:10:49.9335752Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:49.9335857Z U c10::TensorType::get() 2025-05-07T20:10:49.9335982Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:49.9336717Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:49.9336852Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:49.9336976Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:49.9337111Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:49.9337243Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:49.9337371Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:49.9338801Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:49.9339080Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:49.9339189Z U c10::cuda::device_count() 2025-05-07T20:10:49.9339331Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:49.9339491Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:49.9339638Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:49.9339780Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:49.9340044Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:49.9340165Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:49.9340603Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:10:49.9341147Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:49.9341411Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:49.9341925Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.9342273Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:49.9342869Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:49.9343008Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:49.9343128Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:49.9343486Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:10:49.9343692Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:10:49.9343885Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:49.9344058Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:49.9344198Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:49.9344320Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.9344602Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:10:49.9345104Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:10:49.9345222Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:49.9345363Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:49.9345508Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:49.9345661Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:10:49.9345805Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:10:49.9345946Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:49.9346071Z U c10::throwNullDataPtrError() 2025-05-07T20:10:49.9346172Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:49.9346286Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:49.9346492Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:49.9346611Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:49.9346740Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:49.9346909Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:49.9347043Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:49.9347154Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:49.9347292Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:49.9347405Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:49.9347519Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:49.9347638Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:49.9347773Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:49.9347936Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:49.9348055Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:49.9348183Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:49.9348289Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:49.9348400Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:49.9348543Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:49.9348657Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:49.9350928Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:49.9351300Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:10:49.9351448Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.9351820Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.9351925Z U free@GLIBC_2.2.5 2025-05-07T20:10:49.9352052Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.9352227Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.9352425Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:49.9352624Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:49.9352777Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:49.9352896Z U memcmp@GLIBC_2.2.5 2025-05-07T20:10:49.9353007Z U memcpy@GLIBC_2.14 2025-05-07T20:10:49.9353107Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:49.9353236Z U memset@GLIBC_2.2.5 2025-05-07T20:10:49.9353358Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:49.9353501Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:49.9353849Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.9354202Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:49.9354311Z U realloc@GLIBC_2.2.5 2025-05-07T20:10:49.9354538Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:10:49.9354908Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:49.9355315Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.9355701Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:49.9356094Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:49.9356482Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:49.9356617Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:49.9356741Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:49.9356926Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.9357078Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:49.9357255Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:49.9357392Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:49.9357557Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:10:49.9357801Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:49.9358390Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.9358536Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:10:49.9358658Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:49.9358783Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:49.9358920Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:49.9359033Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:49.9359220Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.9359513Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:49.9359642Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:49.9359813Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:10:49.9359992Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:10:49.9360428Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:10:49.9360569Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:10:49.9360699Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:49.9360798Z U strcmp@GLIBC_2.2.5 2025-05-07T20:10:49.9360899Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:49.9361027Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:49.9361648Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:49.9362115Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.9362399Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:49.9362525Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:10:49.9362822Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:10:49.9363022Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:10:49.9363227Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:10:49.9363442Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:10:49.9363808Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:10:49.9363966Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:10:49.9364157Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:10:49.9364348Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:10:49.9364472Z U torch::autograd::Node::assign_parent() 2025-05-07T20:10:49.9364615Z U torch::autograd::Node::metadata() 2025-05-07T20:10:49.9364986Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:10:49.9365242Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:10:49.9365521Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:10:49.9365679Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:10:49.9365899Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:10:49.9366123Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:10:49.9368924Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:10:49.9369090Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:10:49.9369295Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:10:49.9369466Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:10:49.9370273Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:10:49.9370445Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:10:49.9370866Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:10:49.9371253Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:49.9371817Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:49.9371927Z U typeinfo for c10::Error 2025-05-07T20:10:49.9372080Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.9372211Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:10:49.9372347Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:10:49.9372500Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:10:49.9372617Z U typeinfo for torch::autograd::Node 2025-05-07T20:10:49.9374129Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.9375609Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.9377132Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.9378505Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.9379856Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.9381251Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:49.9381420Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:49.9381575Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:49.9381730Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:10:49.9382017Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:49.9382112Z U vtable for c10::Error 2025-05-07T20:10:49.9382584Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:49.9382733Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:10:49.9382948Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:49.9383058Z U vtable for torch::autograd::Node 2025-05-07T20:10:49.9383227Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:10:49.9383349Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:49.9383446Z w _ITM_registerTMCloneTable 2025-05-07T20:10:49.9383549Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:49.9383650Z w __gmon_start__ 2025-05-07T20:10:49.9383743Z w __pthread_key_create 2025-05-07T20:10:49.9383847Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:49.9383969Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:49.9384133Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:49.9384383Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.9384391Z 2025-05-07T20:10:49.9384549Z linux-vdso.so.1 (0x00007ffcaeb63000) 2025-05-07T20:10:49.9384633Z libc10.so => not found 2025-05-07T20:10:49.9384725Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9384828Z libc10_cuda.so => not found 2025-05-07T20:10:49.9384918Z libnccl.so.2 => not found 2025-05-07T20:10:49.9385004Z libcuda.so.1 => not found 2025-05-07T20:10:49.9385553Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f4f09000000) 2025-05-07T20:10:49.9385676Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9385765Z libtorch.so => not found 2025-05-07T20:10:49.9385857Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9385963Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9386054Z libcudart.so.12 => not found 2025-05-07T20:10:49.9386208Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4f08d9c000) 2025-05-07T20:10:49.9386360Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f4f46a0b000) 2025-05-07T20:10:49.9386502Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4f42dd2000) 2025-05-07T20:10:49.9386621Z libc.so.6 => /lib64/libc.so.6 (0x00007f4f08b94000) 2025-05-07T20:10:49.9386738Z /lib64/ld-linux-x86-64.so.2 (0x00007f4f46a69000) 2025-05-07T20:10:49.9386833Z libc10.so => not found 2025-05-07T20:10:49.9386922Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9387004Z libc10_cuda.so => not found 2025-05-07T20:10:49.9387104Z libnccl.so.2 => not found 2025-05-07T20:10:49.9387191Z libcuda.so.1 => not found 2025-05-07T20:10:49.9387819Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f4f07400000) 2025-05-07T20:10:49.9388292Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f4f07000000) 2025-05-07T20:10:49.9389016Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f4f06e59000) 2025-05-07T20:10:49.9389120Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9389401Z libtorch.so => not found 2025-05-07T20:10:49.9389794Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007f4f06800000) 2025-05-07T20:10:49.9390263Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f4f05600000) 2025-05-07T20:10:49.9390367Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9390489Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9390588Z libcudart.so.12 => not found 2025-05-07T20:10:49.9390744Z libm.so.6 => /lib64/libm.so.6 (0x00007f4f07325000) 2025-05-07T20:10:49.9390861Z libtorch.so => not found 2025-05-07T20:10:49.9390951Z libc10.so => not found 2025-05-07T20:10:49.9391050Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9391150Z libc10_cuda.so => not found 2025-05-07T20:10:49.9391338Z libnccl.so.2 => not found 2025-05-07T20:10:49.9391434Z libcuda.so.1 => not found 2025-05-07T20:10:49.9391535Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9391655Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9391754Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9391852Z libcudart.so.12 => not found 2025-05-07T20:10:49.9391942Z libc10.so => not found 2025-05-07T20:10:49.9392054Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9392149Z libc10_cuda.so => not found 2025-05-07T20:10:49.9392246Z libnccl.so.2 => not found 2025-05-07T20:10:49.9392393Z libcuda.so.1 => not found 2025-05-07T20:10:49.9392839Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007f4f42dbb000) 2025-05-07T20:10:49.9392942Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9393081Z libtorch.so => not found 2025-05-07T20:10:49.9393178Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9393277Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9393377Z libcudart.so.12 => not found 2025-05-07T20:10:49.9393476Z libc10.so => not found 2025-05-07T20:10:49.9393571Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9393667Z libc10_cuda.so => not found 2025-05-07T20:10:49.9393778Z libnccl.so.2 => not found 2025-05-07T20:10:49.9393874Z libcuda.so.1 => not found 2025-05-07T20:10:49.9393976Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9394072Z libtorch.so => not found 2025-05-07T20:10:49.9394181Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9394302Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9394399Z libcudart.so.12 => not found 2025-05-07T20:10:49.9394507Z libc10.so => not found 2025-05-07T20:10:49.9394601Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9394697Z libc10_cuda.so => not found 2025-05-07T20:10:49.9394792Z libnccl.so.2 => not found 2025-05-07T20:10:49.9394900Z libcuda.so.1 => not found 2025-05-07T20:10:49.9395266Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f4f42d3c000) 2025-05-07T20:10:49.9395367Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9395472Z libtorch.so => not found 2025-05-07T20:10:49.9395571Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9395672Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9395762Z libtorch.so => not found 2025-05-07T20:10:49.9395863Z libc10.so => not found 2025-05-07T20:10:49.9395955Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9396049Z libc10_cuda.so => not found 2025-05-07T20:10:49.9396156Z libnccl.so.2 => not found 2025-05-07T20:10:49.9396246Z libcuda.so.1 => not found 2025-05-07T20:10:49.9396344Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9396441Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9396544Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9396635Z libcudart.so.12 => not found 2025-05-07T20:10:49.9396728Z libtorch.so => not found 2025-05-07T20:10:49.9396825Z libc10.so => not found 2025-05-07T20:10:49.9396941Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9397037Z libc10_cuda.so => not found 2025-05-07T20:10:49.9397132Z libnccl.so.2 => not found 2025-05-07T20:10:49.9397233Z libcuda.so.1 => not found 2025-05-07T20:10:49.9397331Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9397454Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9397561Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9397740Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f4f42d2f000) 2025-05-07T20:10:49.9397833Z libtorch.so => not found 2025-05-07T20:10:49.9397926Z libc10.so => not found 2025-05-07T20:10:49.9398030Z libnvrtc.so.12 => not found 2025-05-07T20:10:49.9398122Z libc10_cuda.so => not found 2025-05-07T20:10:49.9398216Z libnccl.so.2 => not found 2025-05-07T20:10:49.9398321Z libcuda.so.1 => not found 2025-05-07T20:10:49.9398423Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:49.9398519Z libtorch_cpu.so => not found 2025-05-07T20:10:49.9398621Z libtorch_cuda.so => not found 2025-05-07T20:10:49.9398767Z librt.so.1 => /lib64/librt.so.1 (0x00007f4f42d26000) 2025-05-07T20:10:49.9398773Z 2025-05-07T20:10:49.9398881Z [CHECK] Displaying ELF information: 2025-05-07T20:10:49.9399183Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:10:49.9399188Z 2025-05-07T20:10:49.9399227Z 2025-05-07T20:10:49.9399393Z Dynamic section at offset 0x3a27010 contains 41 entries: 2025-05-07T20:10:49.9399507Z Tag Type Name/Value 2025-05-07T20:10:49.9399719Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:49.9399927Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:49.9400125Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:49.9400323Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:49.9400551Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:49.9400813Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:49.9401027Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:49.9401243Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:49.9401443Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:49.9401648Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:49.9401884Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:49.9402083Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:49.9402282Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:49.9402487Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:49.9402682Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:49.9402898Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:49.9403185Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:10:49.9403381Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:49.9403497Z 0x000000000000000c (INIT) 0x80000 2025-05-07T20:10:49.9403614Z 0x000000000000000d (FINI) 0x261c5c 2025-05-07T20:10:49.9403748Z 0x0000000000000019 (INIT_ARRAY) 0x3a223b0 2025-05-07T20:10:49.9403878Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:10:49.9404001Z 0x000000000000001a (FINI_ARRAY) 0x3a22468 2025-05-07T20:10:49.9404141Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:49.9404254Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:49.9404369Z 0x0000000000000005 (STRTAB) 0xe368 2025-05-07T20:10:49.9404502Z 0x0000000000000006 (SYMTAB) 0x33a0 2025-05-07T20:10:49.9404649Z 0x000000000000000a (STRSZ) 374997 (bytes) 2025-05-07T20:10:49.9404768Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:49.9404890Z 0x0000000000000003 (PLTGOT) 0x3a28fe8 2025-05-07T20:10:49.9405076Z 0x0000000000000002 (PLTRELSZ) 18456 (bytes) 2025-05-07T20:10:49.9405185Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:49.9405296Z 0x0000000000000017 (JMPREL) 0x7b2d8 2025-05-07T20:10:49.9405416Z 0x0000000000000007 (RELA) 0x6ac28 2025-05-07T20:10:49.9405546Z 0x0000000000000008 (RELASZ) 67248 (bytes) 2025-05-07T20:10:49.9405667Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:49.9405788Z 0x000000006ffffffe (VERNEED) 0x6aae8 2025-05-07T20:10:49.9405907Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:49.9406067Z 0x000000006ffffff0 (VERSYM) 0x69c3e 2025-05-07T20:10:49.9406182Z 0x000000006ffffff9 (RELACOUNT) 1392 2025-05-07T20:10:49.9406303Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:49.9406308Z 2025-05-07T20:10:49.9406424Z ################################################################################ 2025-05-07T20:10:49.9406429Z 2025-05-07T20:10:49.9406433Z 2025-05-07T20:10:49.9406546Z ################################################################################ 2025-05-07T20:10:49.9406895Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.9407000Z [CHECK] Listing out library size: 2025-05-07T20:10:49.9407314Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.9407318Z 2025-05-07T20:10:49.9407605Z 328 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.9407635Z 2025-05-07T20:10:49.9408079Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:49.9408616Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:49.9408621Z 2025-05-07T20:10:50.0034783Z GLIBC_2.2.5 2025-05-07T20:10:50.0034887Z GLIBC_2.3 2025-05-07T20:10:50.0034972Z GLIBC_2.14 2025-05-07T20:10:50.0036201Z 2025-05-07T20:10:50.0036251Z 2025-05-07T20:10:50.0036773Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:50.0037543Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:50.0037549Z 2025-05-07T20:10:50.0673763Z GLIBCXX_3.4 2025-05-07T20:10:50.0673889Z GLIBCXX_3.4.9 2025-05-07T20:10:50.0673976Z GLIBCXX_3.4.11 2025-05-07T20:10:50.0674074Z GLIBCXX_3.4.18 2025-05-07T20:10:50.0674159Z GLIBCXX_3.4.20 2025-05-07T20:10:50.0674242Z GLIBCXX_3.4.21 2025-05-07T20:10:50.0675117Z 2025-05-07T20:10:50.0675251Z 2025-05-07T20:10:50.0696515Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.97CNah0KFq.symbols.txt 2025-05-07T20:10:50.0696534Z 2025-05-07T20:10:50.1324976Z 2025-05-07T20:10:50.1378207Z [CHECK] Total Number of symbols: 3739 2025-05-07T20:10:50.1434460Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:10:50.1453510Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.EtcLWVQTXZ.usymbols.txt 2025-05-07T20:10:50.1454096Z 2025-05-07T20:10:50.1486990Z 2025-05-07T20:10:50.1514635Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:10:50.1531395Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:50.1534149Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:50.1535759Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:50.1536675Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:50.1537139Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:50.1537526Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:50.1537914Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:50.1538292Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:50.1538658Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:50.1539017Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:50.1539375Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:50.1539703Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:50.1540028Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:50.1540350Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:50.1540666Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:50.1541010Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:50.1541327Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:50.1541659Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:10:50.1542012Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:50.1542429Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:50.1543017Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:50.1543452Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:50.1543929Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:50.1544963Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.1546278Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.1547390Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:50.1548024Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:50.1548975Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.1550163Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.1551041Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:50.1551530Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:50.1551894Z U at::globalContext() 2025-05-07T20:10:50.1552346Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.1552789Z U c10::BoolType::get() 2025-05-07T20:10:50.1553155Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:50.1553546Z U c10::FloatType::get() 2025-05-07T20:10:50.1553878Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:50.1554277Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.1554726Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:50.1555118Z U c10::IntType::get() 2025-05-07T20:10:50.1555498Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:50.1555914Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:50.1556303Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:50.1556780Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:10:50.1557179Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:50.1557620Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:10:50.1558057Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:50.1572313Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:50.1573086Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:50.1573505Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:10:50.1573875Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:50.1574245Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:10:50.1574617Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:10:50.1574986Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:10:50.1575347Z U c10::SymInt::toSymNode() const 2025-05-07T20:10:50.1575667Z U c10::SymIntType::get() 2025-05-07T20:10:50.1576043Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:50.1576468Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:50.1576832Z U c10::TensorType::get() 2025-05-07T20:10:50.1577168Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:50.1578304Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:50.1579296Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:50.1579665Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:50.1580010Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:50.1580360Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:50.1580688Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:50.1581073Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:50.1581531Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:50.1581994Z U c10::cuda::device_count() 2025-05-07T20:10:50.1582347Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:50.1582736Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:50.1583133Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:50.1583531Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:50.1583952Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:50.1584353Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:50.1585197Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:50.1586083Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:50.1586945Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:50.1587919Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:50.1589158Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:50.1590076Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:50.1590412Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:50.1590794Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:50.1591321Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:50.1591739Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:50.1592146Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:10:50.1592641Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:10:50.1593106Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:50.1593513Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:50.1593918Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:10:50.1594288Z U c10::throwNullDataPtrError() 2025-05-07T20:10:50.1594626Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:50.1594954Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:50.1595431Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:50.1595880Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:50.1596296Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:50.1596679Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:50.1597054Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:50.1597483Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:50.1597834Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:50.1598197Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:50.1598547Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:50.1598896Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:50.1599269Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:50.1599641Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:50.1600027Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:50.1600406Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:50.1600763Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:50.1601117Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:50.1601471Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:50.1601844Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:50.1604337Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:50.1606887Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:50.1607310Z U float at::Tensor::item() const 2025-05-07T20:10:50.1607674Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:50.1608075Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.1608520Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:50.1608908Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.1609336Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:50.1609766Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:50.1610192Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.1610563Z U memcpy@GLIBC_2.14 2025-05-07T20:10:50.1610844Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:50.1611148Z U memset@GLIBC_2.2.5 2025-05-07T20:10:50.1611464Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:50.1611811Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:50.1612381Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.1613137Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.1613881Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.1614650Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.1615412Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.1616177Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.1617061Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:50.1617899Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:50.1618827Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:50.1619584Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:50.1620155Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:50.1620479Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:50.1620816Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:50.1621224Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:50.1621609Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:50.1622010Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:50.1622465Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:50.1623349Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:50.1624120Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:50.1624449Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:50.1624790Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:50.1625118Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:50.1625489Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:50.1626172Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:50.1626644Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:50.1627182Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:50.1627543Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:50.1627870Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:50.1628737Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:50.1629902Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:50.1630755Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:50.1631581Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:50.1632620Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:50.1635439Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.1639651Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.1643866Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.1647860Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.1652097Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.1656114Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.1659878Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:50.1661896Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:50.1662331Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:50.1662760Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:50.1663378Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:50.1664060Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:50.1664512Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:50.1665029Z w _ITM_registerTMCloneTable 2025-05-07T20:10:50.1665347Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:50.1665779Z w __gmon_start__ 2025-05-07T20:10:50.1666058Z w __pthread_key_create 2025-05-07T20:10:50.1666371Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:50.1666759Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:50.1667141Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:50.1667647Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:50.1668030Z 2025-05-07T20:10:50.1668187Z linux-vdso.so.1 (0x00007ffcd297c000) 2025-05-07T20:10:50.1668484Z libc10.so => not found 2025-05-07T20:10:50.1668741Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1669001Z libc10_cuda.so => not found 2025-05-07T20:10:50.1669275Z libnccl.so.2 => not found 2025-05-07T20:10:50.1669528Z libcuda.so.1 => not found 2025-05-07T20:10:50.1670277Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fc72b000000) 2025-05-07T20:10:50.1671096Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1671474Z libtorch.so => not found 2025-05-07T20:10:50.1671731Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1672019Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1672300Z libcudart.so.12 => not found 2025-05-07T20:10:50.1672632Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc72ad9c000) 2025-05-07T20:10:50.1673060Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc764daa000) 2025-05-07T20:10:50.1673463Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc764d7c000) 2025-05-07T20:10:50.1673855Z libc.so.6 => /lib64/libc.so.6 (0x00007fc72ab94000) 2025-05-07T20:10:50.1674208Z /lib64/ld-linux-x86-64.so.2 (0x00007fc779d38000) 2025-05-07T20:10:50.1674536Z libc10.so => not found 2025-05-07T20:10:50.1674775Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1675042Z libc10_cuda.so => not found 2025-05-07T20:10:50.1675308Z libnccl.so.2 => not found 2025-05-07T20:10:50.1675558Z libcuda.so.1 => not found 2025-05-07T20:10:50.1676197Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fc729400000) 2025-05-07T20:10:50.1677291Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fc729000000) 2025-05-07T20:10:50.1678432Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fc728e59000) 2025-05-07T20:10:50.1679191Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1679515Z libtorch.so => not found 2025-05-07T20:10:50.1680031Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007fc728800000) 2025-05-07T20:10:50.1680958Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fc727600000) 2025-05-07T20:10:50.1681634Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1681905Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1682175Z libcudart.so.12 => not found 2025-05-07T20:10:50.1682465Z libm.so.6 => /lib64/libm.so.6 (0x00007fc729325000) 2025-05-07T20:10:50.1682801Z libtorch.so => not found 2025-05-07T20:10:50.1683056Z libc10.so => not found 2025-05-07T20:10:50.1683298Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1683673Z libc10_cuda.so => not found 2025-05-07T20:10:50.1683909Z libnccl.so.2 => not found 2025-05-07T20:10:50.1684151Z libcuda.so.1 => not found 2025-05-07T20:10:50.1684389Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1684815Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1685121Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1685390Z libcudart.so.12 => not found 2025-05-07T20:10:50.1685637Z libc10.so => not found 2025-05-07T20:10:50.1685887Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1686151Z libc10_cuda.so => not found 2025-05-07T20:10:50.1686396Z libnccl.so.2 => not found 2025-05-07T20:10:50.1686650Z libcuda.so.1 => not found 2025-05-07T20:10:50.1687234Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007fc779d19000) 2025-05-07T20:10:50.1687919Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1688180Z libtorch.so => not found 2025-05-07T20:10:50.1688436Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1688697Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1688959Z libcudart.so.12 => not found 2025-05-07T20:10:50.1689208Z libc10.so => not found 2025-05-07T20:10:50.1689456Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1689720Z libc10_cuda.so => not found 2025-05-07T20:10:50.1689966Z libnccl.so.2 => not found 2025-05-07T20:10:50.1690218Z libcuda.so.1 => not found 2025-05-07T20:10:50.1690467Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1690737Z libtorch.so => not found 2025-05-07T20:10:50.1691007Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1691260Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1691519Z libcudart.so.12 => not found 2025-05-07T20:10:50.1691763Z libc10.so => not found 2025-05-07T20:10:50.1692002Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1692257Z libc10_cuda.so => not found 2025-05-07T20:10:50.1692502Z libnccl.so.2 => not found 2025-05-07T20:10:50.1692750Z libcuda.so.1 => not found 2025-05-07T20:10:50.1693251Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007fc764d05000) 2025-05-07T20:10:50.1693808Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1694069Z libtorch.so => not found 2025-05-07T20:10:50.1694320Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1694571Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1694827Z libtorch.so => not found 2025-05-07T20:10:50.1695057Z libc10.so => not found 2025-05-07T20:10:50.1695293Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1695547Z libc10_cuda.so => not found 2025-05-07T20:10:50.1695791Z libnccl.so.2 => not found 2025-05-07T20:10:50.1696029Z libcuda.so.1 => not found 2025-05-07T20:10:50.1696378Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1696816Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1697076Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1697338Z libcudart.so.12 => not found 2025-05-07T20:10:50.1697613Z libtorch.so => not found 2025-05-07T20:10:50.1697864Z libc10.so => not found 2025-05-07T20:10:50.1698092Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1698337Z libc10_cuda.so => not found 2025-05-07T20:10:50.1698579Z libnccl.so.2 => not found 2025-05-07T20:10:50.1698812Z libcuda.so.1 => not found 2025-05-07T20:10:50.1699099Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1699355Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1699612Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1699940Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fc72ab8f000) 2025-05-07T20:10:50.1700303Z libtorch.so => not found 2025-05-07T20:10:50.1700524Z libc10.so => not found 2025-05-07T20:10:50.1700756Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.1700999Z libc10_cuda.so => not found 2025-05-07T20:10:50.1701247Z libnccl.so.2 => not found 2025-05-07T20:10:50.1701491Z libcuda.so.1 => not found 2025-05-07T20:10:50.1701733Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.1701992Z libtorch_cpu.so => not found 2025-05-07T20:10:50.1702244Z libtorch_cuda.so => not found 2025-05-07T20:10:50.1702547Z librt.so.1 => /lib64/librt.so.1 (0x00007fc72ab86000) 2025-05-07T20:10:50.1702781Z 2025-05-07T20:10:50.1702886Z [CHECK] Displaying ELF information: 2025-05-07T20:10:50.1703365Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:10:50.1703751Z 2025-05-07T20:10:50.1703755Z 2025-05-07T20:10:50.1703920Z Dynamic section at offset 0x147859a8 contains 41 entries: 2025-05-07T20:10:50.1704280Z Tag Type Name/Value 2025-05-07T20:10:50.1704674Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:50.1705163Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:50.1705672Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:50.1706159Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:50.1706688Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:50.1707248Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:50.1707818Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:50.1708321Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:50.1708807Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:50.1709303Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:50.1709833Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:50.1710343Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:50.1710839Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:10:50.1711394Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:50.1712055Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:50.1712556Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:50.1713148Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:10:50.1713724Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:50.1714120Z 0x000000000000000c (INIT) 0x1dc000 2025-05-07T20:10:50.1714447Z 0x000000000000000d (FINI) 0xe754cc 2025-05-07T20:10:50.1714773Z 0x0000000000000019 (INIT_ARRAY) 0x1476a588 2025-05-07T20:10:50.1715133Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:10:50.1715485Z 0x000000000000001a (FINI_ARRAY) 0x1476a830 2025-05-07T20:10:50.1715826Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:50.1716156Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:50.1716512Z 0x0000000000000005 (STRTAB) 0x1c8a0 2025-05-07T20:10:50.1716831Z 0x0000000000000006 (SYMTAB) 0x6a00 2025-05-07T20:10:50.1717172Z 0x000000000000000a (STRSZ) 1486798 (bytes) 2025-05-07T20:10:50.1717532Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:50.1717904Z 0x0000000000000003 (PLTGOT) 0x1478afe8 2025-05-07T20:10:50.1718265Z 0x0000000000000002 (PLTRELSZ) 22152 (bytes) 2025-05-07T20:10:50.1718598Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:50.1718913Z 0x0000000000000017 (JMPREL) 0x1d5988 2025-05-07T20:10:50.1719238Z 0x0000000000000007 (RELA) 0x1896c8 2025-05-07T20:10:50.1719588Z 0x0000000000000008 (RELASZ) 312000 (bytes) 2025-05-07T20:10:50.1719945Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:50.1720278Z 0x000000006ffffffe (VERNEED) 0x1895a8 2025-05-07T20:10:50.1720611Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:50.1720925Z 0x000000006ffffff0 (VERSYM) 0x18786e 2025-05-07T20:10:50.1721254Z 0x000000006ffffff9 (RELACOUNT) 8035 2025-05-07T20:10:50.1721557Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:50.1721761Z 2025-05-07T20:10:50.1721870Z ################################################################################ 2025-05-07T20:10:50.1722092Z 2025-05-07T20:10:50.1722096Z 2025-05-07T20:10:50.1722212Z ################################################################################ 2025-05-07T20:10:50.1722749Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:50.1723278Z [CHECK] Listing out library size: 2025-05-07T20:10:50.1723859Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:50.1724251Z 2025-05-07T20:10:50.1724478Z 142 ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:50.1724837Z 2025-05-07T20:10:50.1725420Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:50.1726475Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:10:50.1727100Z 2025-05-07T20:10:50.1919219Z GLIBC_2.2.5 2025-05-07T20:10:50.1919972Z GLIBC_2.3 2025-05-07T20:10:50.1920631Z GLIBC_2.14 2025-05-07T20:10:50.1920994Z 2025-05-07T20:10:50.1921015Z 2025-05-07T20:10:50.1922817Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:50.1926191Z + objdump -TC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:10:50.1927441Z 2025-05-07T20:10:50.2190213Z GLIBCXX_3.4 2025-05-07T20:10:50.2190496Z GLIBCXX_3.4.9 2025-05-07T20:10:50.2190894Z GLIBCXX_3.4.11 2025-05-07T20:10:50.2191099Z GLIBCXX_3.4.18 2025-05-07T20:10:50.2191373Z GLIBCXX_3.4.20 2025-05-07T20:10:50.2191570Z GLIBCXX_3.4.21 2025-05-07T20:10:50.2191700Z 2025-05-07T20:10:50.2191717Z 2025-05-07T20:10:50.2211722Z + nm -gDC ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.bL3GtQYQof.symbols.txt 2025-05-07T20:10:50.2213336Z 2025-05-07T20:10:50.2447701Z 2025-05-07T20:10:50.2472908Z [CHECK] Total Number of symbols: 1629 2025-05-07T20:10:50.2494291Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:10:50.2511589Z + nm -gDCu ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.OsJ2rSVhNq.usymbols.txt 2025-05-07T20:10:50.2513250Z 2025-05-07T20:10:50.2533396Z 2025-05-07T20:10:50.2557739Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:10:50.2574848Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:50.2577387Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:50.2578671Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:10:50.2579105Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:10:50.2579615Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:10:50.2580012Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:10:50.2580401Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:10:50.2580776Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:10:50.2581139Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:10:50.2581602Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:10:50.2581949Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:10:50.2582242Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:10:50.2582552Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:10:50.2582847Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:10:50.2583139Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:10:50.2583456Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:10:50.2583755Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:10:50.2584077Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:10:50.2584452Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:10:50.2584863Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:10:50.2585287Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:10:50.2585905Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:10:50.2586769Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.2588158Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.2589102Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:10:50.2589724Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:10:50.2590647Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.2592079Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:10:50.2592961Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:10:50.2593373Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:10:50.2593737Z U at::globalContext() 2025-05-07T20:10:50.2594163Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.2594589Z U c10::BoolType::get() 2025-05-07T20:10:50.2594966Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:10:50.2595344Z U c10::FloatType::get() 2025-05-07T20:10:50.2595681Z U c10::GeneratorImpl::device() const 2025-05-07T20:10:50.2596084Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.2596531Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:10:50.2596896Z U c10::IntType::get() 2025-05-07T20:10:50.2597291Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:10:50.2597710Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:10:50.2598097Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:50.2598557Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:10:50.2598978Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:10:50.2599650Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:10:50.2600329Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:10:50.2600698Z U c10::SymInt::promote_to_negative() 2025-05-07T20:10:50.2601039Z U c10::SymIntType::get() 2025-05-07T20:10:50.2601412Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:10:50.2601836Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:10:50.2602222Z U c10::TensorType::get() 2025-05-07T20:10:50.2602553Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:10:50.2603526Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:10:50.2604610Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:10:50.2604969Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:10:50.2605323Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:10:50.2605659Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:10:50.2606007Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:10:50.2606368Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:10:50.2606844Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:10:50.2607324Z U c10::cuda::device_count() 2025-05-07T20:10:50.2607661Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:10:50.2608052Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:10:50.2608431Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:10:50.2608825Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:10:50.2609267Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:10:50.2609648Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:10:50.2610392Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:10:50.2611265Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:10:50.2612129Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:50.2613074Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:10:50.2614196Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:10:50.2614976Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:10:50.2615311Z U c10::impl::GPUTrace::haveState 2025-05-07T20:10:50.2615654Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:10:50.2616068Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:10:50.2616469Z U c10::impl::device_guard_impl_registry 2025-05-07T20:10:50.2616833Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:10:50.2617211Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:10:50.2618727Z U c10::throwNullDataPtrError() 2025-05-07T20:10:50.2619060Z U c10::warn(c10::Warning const&) 2025-05-07T20:10:50.2619376Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:10:50.2619777Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:10:50.2620189Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:10:50.2620524Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:10:50.2620878Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:10:50.2621222Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:10:50.2621575Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:10:50.2621901Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:10:50.2622238Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:10:50.2622549Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:10:50.2622887Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:10:50.2623236Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:10:50.2623581Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:10:50.2623939Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:10:50.2624439Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:10:50.2624783Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:10:50.2625115Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:10:50.2625474Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:10:50.2625878Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:10:50.2628443Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:10:50.2631186Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:10:50.2631719Z U float at::Tensor::item() const 2025-05-07T20:10:50.2632088Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:10:50.2632515Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.2632977Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:10:50.2633363Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.2633859Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:10:50.2634283Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:10:50.2634684Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:10:50.2635038Z U memcpy@GLIBC_2.14 2025-05-07T20:10:50.2635336Z U memmove@GLIBC_2.2.5 2025-05-07T20:10:50.2635632Z U memset@GLIBC_2.2.5 2025-05-07T20:10:50.2635937Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:10:50.2636306Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:10:50.2636882Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.2637719Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.2638509Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.2639312Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:10:50.2640129Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:10:50.2640990Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:10:50.2641865Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:10:50.2642718Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:10:50.2643324Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:10:50.2643671Z U std::__throw_bad_array_new_length() 2025-05-07T20:10:50.2644036Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:50.2644440Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:10:50.2644871Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:10:50.2645295Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:10:50.2645794Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:10:50.2646748Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:10:50.2647612Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:10:50.2647980Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:10:50.2648335Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:10:50.2648702Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:10:50.2649128Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:10:50.2649674Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:10:50.2650193Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:10:50.2650540Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:10:50.2650863Z U strlen@GLIBC_2.2.5 2025-05-07T20:10:50.2651172Z U torch::CppFunction::~CppFunction() 2025-05-07T20:10:50.2652224Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:10:50.2653326Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:10:50.2654099Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:10:50.2654790Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:10:50.2655310Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:10:50.2656685Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:50.2658074Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:50.2659357Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:50.2660654Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:50.2661911Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:50.2663403Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:10:50.2665732Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.2667987Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.2670131Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.2672249Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.2674185Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.2676115Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:10:50.2677869Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:10:50.2678077Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:10:50.2678246Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:10:50.2678408Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:10:50.2678757Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:10:50.2679013Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:10:50.2679129Z w _ITM_deregisterTMCloneTable 2025-05-07T20:10:50.2679252Z w _ITM_registerTMCloneTable 2025-05-07T20:10:50.2679354Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:10:50.2679449Z w __gmon_start__ 2025-05-07T20:10:50.2679563Z w __pthread_key_create 2025-05-07T20:10:50.2679673Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:10:50.2679786Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:10:50.2679941Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:10:50.2680215Z + ldd ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:50.2680223Z 2025-05-07T20:10:50.2680404Z linux-vdso.so.1 (0x00007ffc55d8c000) 2025-05-07T20:10:50.2680497Z libc10.so => not found 2025-05-07T20:10:50.2680597Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2680704Z libc10_cuda.so => not found 2025-05-07T20:10:50.2680799Z libnccl.so.2 => not found 2025-05-07T20:10:50.2680891Z libcuda.so.1 => not found 2025-05-07T20:10:50.2681470Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f532f800000) 2025-05-07T20:10:50.2681617Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2681711Z libtorch.so => not found 2025-05-07T20:10:50.2681809Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2681920Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2682019Z libcudart.so.12 => not found 2025-05-07T20:10:50.2682207Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f532f59c000) 2025-05-07T20:10:50.2682364Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f53695d2000) 2025-05-07T20:10:50.2682503Z libc.so.6 => /lib64/libc.so.6 (0x00007f532f394000) 2025-05-07T20:10:50.2682629Z /lib64/ld-linux-x86-64.so.2 (0x00007f53725fd000) 2025-05-07T20:10:50.2682724Z libc10.so => not found 2025-05-07T20:10:50.2682841Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2682934Z libc10_cuda.so => not found 2025-05-07T20:10:50.2683025Z libnccl.so.2 => not found 2025-05-07T20:10:50.2683127Z libcuda.so.1 => not found 2025-05-07T20:10:50.2683600Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f532dc00000) 2025-05-07T20:10:50.2684077Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f532d800000) 2025-05-07T20:10:50.2684639Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f532d659000) 2025-05-07T20:10:50.2684742Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2684836Z libtorch.so => not found 2025-05-07T20:10:50.2685191Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so (0x00007f532d000000) 2025-05-07T20:10:50.2685667Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f532be00000) 2025-05-07T20:10:50.2685765Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2685891Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2685997Z libcudart.so.12 => not found 2025-05-07T20:10:50.2686127Z libm.so.6 => /lib64/libm.so.6 (0x00007f532db25000) 2025-05-07T20:10:50.2686222Z libtorch.so => not found 2025-05-07T20:10:50.2686331Z libc10.so => not found 2025-05-07T20:10:50.2686428Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2686520Z libc10_cuda.so => not found 2025-05-07T20:10:50.2686612Z libnccl.so.2 => not found 2025-05-07T20:10:50.2686721Z libcuda.so.1 => not found 2025-05-07T20:10:50.2686821Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2686917Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2687026Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2687149Z libcudart.so.12 => not found 2025-05-07T20:10:50.2687307Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f536957a000) 2025-05-07T20:10:50.2687397Z libc10.so => not found 2025-05-07T20:10:50.2687506Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2687604Z libc10_cuda.so => not found 2025-05-07T20:10:50.2687703Z libnccl.so.2 => not found 2025-05-07T20:10:50.2687808Z libcuda.so.1 => not found 2025-05-07T20:10:50.2688258Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so (0x00007f536956d000) 2025-05-07T20:10:50.2688365Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2688461Z libtorch.so => not found 2025-05-07T20:10:50.2688578Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2688680Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2688780Z libcudart.so.12 => not found 2025-05-07T20:10:50.2688881Z libc10.so => not found 2025-05-07T20:10:50.2688981Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2689079Z libc10_cuda.so => not found 2025-05-07T20:10:50.2689176Z libnccl.so.2 => not found 2025-05-07T20:10:50.2689283Z libcuda.so.1 => not found 2025-05-07T20:10:50.2689385Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2689481Z libtorch.so => not found 2025-05-07T20:10:50.2689593Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2689696Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2689820Z libcudart.so.12 => not found 2025-05-07T20:10:50.2689909Z libc10.so => not found 2025-05-07T20:10:50.2690014Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2690109Z libc10_cuda.so => not found 2025-05-07T20:10:50.2690203Z libnccl.so.2 => not found 2025-05-07T20:10:50.2690312Z libcuda.so.1 => not found 2025-05-07T20:10:50.2690722Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so (0x00007f532f31d000) 2025-05-07T20:10:50.2690827Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2690922Z libtorch.so => not found 2025-05-07T20:10:50.2691034Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2691133Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2691232Z libtorch.so => not found 2025-05-07T20:10:50.2691337Z libc10.so => not found 2025-05-07T20:10:50.2691435Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2691528Z libc10_cuda.so => not found 2025-05-07T20:10:50.2691623Z libnccl.so.2 => not found 2025-05-07T20:10:50.2691733Z libcuda.so.1 => not found 2025-05-07T20:10:50.2691837Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2691939Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2692054Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2692153Z libcudart.so.12 => not found 2025-05-07T20:10:50.2692247Z libtorch.so => not found 2025-05-07T20:10:50.2692342Z libc10.so => not found 2025-05-07T20:10:50.2692451Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2692545Z libc10_cuda.so => not found 2025-05-07T20:10:50.2692640Z libnccl.so.2 => not found 2025-05-07T20:10:50.2692749Z libcuda.so.1 => not found 2025-05-07T20:10:50.2692851Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2692948Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2693045Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2693345Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f5369558000) 2025-05-07T20:10:50.2693437Z libtorch.so => not found 2025-05-07T20:10:50.2693525Z libc10.so => not found 2025-05-07T20:10:50.2693659Z libnvrtc.so.12 => not found 2025-05-07T20:10:50.2693751Z libc10_cuda.so => not found 2025-05-07T20:10:50.2693843Z libnccl.so.2 => not found 2025-05-07T20:10:50.2693930Z libcuda.so.1 => not found 2025-05-07T20:10:50.2694041Z libnvidia-ml.so.1 => not found 2025-05-07T20:10:50.2694136Z libtorch_cpu.so => not found 2025-05-07T20:10:50.2694232Z libtorch_cuda.so => not found 2025-05-07T20:10:50.2694376Z librt.so.1 => /lib64/librt.so.1 (0x00007f536954f000) 2025-05-07T20:10:50.2694382Z 2025-05-07T20:10:50.2694489Z [CHECK] Displaying ELF information: 2025-05-07T20:10:50.2694779Z + readelf -d ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:10:50.2694811Z 2025-05-07T20:10:50.2694815Z 2025-05-07T20:10:50.2694988Z Dynamic section at offset 0x8d68cc8 contains 40 entries: 2025-05-07T20:10:50.2695102Z Tag Type Name/Value 2025-05-07T20:10:50.2695298Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:10:50.2695515Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:10:50.2695712Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:10:50.2695905Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:10:50.2696098Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:10:50.2696371Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:10:50.2696580Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:10:50.2696774Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:10:50.2696988Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:10:50.2697189Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:10:50.2697387Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:10:50.2697626Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:10:50.2697821Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:10:50.2698008Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:10:50.2698255Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:10:50.2698525Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:10:50.2698710Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:10:50.2698839Z 0x000000000000000c (INIT) 0xbe000 2025-05-07T20:10:50.2698952Z 0x000000000000000d (FINI) 0x5f04ec 2025-05-07T20:10:50.2699072Z 0x0000000000000019 (INIT_ARRAY) 0x8d5ea18 2025-05-07T20:10:50.2699199Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:10:50.2699329Z 0x000000000000001a (FINI_ARRAY) 0x8d5eae0 2025-05-07T20:10:50.2699450Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:10:50.2699563Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:10:50.2699686Z 0x0000000000000005 (STRTAB) 0xc600 2025-05-07T20:10:50.2699794Z 0x0000000000000006 (SYMTAB) 0x2d30 2025-05-07T20:10:50.2699930Z 0x000000000000000a (STRSZ) 597451 (bytes) 2025-05-07T20:10:50.2700048Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:10:50.2700180Z 0x0000000000000003 (PLTGOT) 0x8d6afe8 2025-05-07T20:10:50.2700313Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:10:50.2700422Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:10:50.2700546Z 0x0000000000000017 (JMPREL) 0xbab38 2025-05-07T20:10:50.2700654Z 0x0000000000000007 (RELA) 0x9f1a8 2025-05-07T20:10:50.2700788Z 0x0000000000000008 (RELASZ) 113040 (bytes) 2025-05-07T20:10:50.2700947Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:10:50.2701065Z 0x000000006ffffffe (VERNEED) 0x9f088 2025-05-07T20:10:50.2701175Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:10:50.2701287Z 0x000000006ffffff0 (VERSYM) 0x9e3cc 2025-05-07T20:10:50.2701413Z 0x000000006ffffff9 (RELACOUNT) 3303 2025-05-07T20:10:50.2701515Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:10:50.2701519Z 2025-05-07T20:10:50.2701633Z ################################################################################ 2025-05-07T20:10:50.2701638Z 2025-05-07T20:10:50.2701642Z 2025-05-07T20:10:50.2701889Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:10:50.2782107Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.2805407Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.3037675Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.3075012Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.3127115Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.3161222Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.3195688Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.3222826Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:10:50.3338156Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.3361912Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.3592980Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.3628910Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.3678793Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.3713454Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.3746244Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.3774177Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.4186369Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.4544469Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.4789962Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.5707126Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.5742280Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.5831547Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.6156397Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.12/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:10:50.6159689Z ################################################################################ 2025-05-07T20:10:50.6160001Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:10:50.6160008Z 2025-05-07T20:10:50.6160483Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:10:50.6160650Z 2025-05-07T20:11:02.2729833Z 2025-05-07T20:11:02.2730572Z fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:02.2731228Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:02.2731582Z 2025-05-07T20:11:02.2731766Z The wheel references external versioned symbols in these 2025-05-07T20:11:02.2732330Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:11:02.2732744Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_3.0'}, 2025-05-07T20:11:02.2733164Z libstdc++.so.6 with versions {'GLIBCXX_3.4.11', 'CXXABI_1.3', 2025-05-07T20:11:02.2733600Z 'GLIBCXX_3.4.18', 'CXXABI_1.3.5', 'CXXABI_1.3.3', 'GLIBCXX_3.4.15', 2025-05-07T20:11:02.2734048Z 'GLIBCXX_3.4.14', 'GLIBCXX_3.4.9', 'GLIBCXX_3.4', 'GLIBCXX_3.4.20', 2025-05-07T20:11:02.2734496Z 'CXXABI_1.3.11', 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.19', 'CXXABI_1.3.7'}, 2025-05-07T20:11:02.2734936Z libc.so.6 with versions {'GLIBC_2.3.2', 'GLIBC_2.14', 'GLIBC_2.3', 2025-05-07T20:11:02.2735375Z 'GLIBC_2.3.3', 'GLIBC_2.7', 'GLIBC_2.2.5', 'GLIBC_2.17', 'GLIBC_2.6'}, 2025-05-07T20:11:02.2735805Z libpthread.so.0 with versions {'GLIBC_2.2.5', 'GLIBC_2.3.2', 2025-05-07T20:11:02.2736228Z 'GLIBC_2.3.4'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:11:02.2736843Z libcudart.so.12 with versions {'libcudart.so.12'}, libgomp.so.1 with 2025-05-07T20:11:02.2737323Z versions {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.2.5', 2025-05-07T20:11:02.2737664Z 'GLIBC_2.3.4'} 2025-05-07T20:11:02.2737809Z 2025-05-07T20:11:02.2738007Z This constrains the platform tag to "manylinux_2_27_x86_64". In order 2025-05-07T20:11:02.2738580Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:02.2739029Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:02.2739433Z libraries, such as a recent manylinux image. 2025-05-07T20:11:02.3482628Z 2025-05-07T20:11:02.3482785Z 2025-05-07T20:11:02.3483597Z ################################################################################ 2025-05-07T20:11:02.3484683Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:02.3485403Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:02.3485766Z 2025-05-07T20:11:02.3503646Z -rw-r--r--. 1 root root 505M May 7 20:10 dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:02.3504143Z 2025-05-07T20:11:02.3504280Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:02.3506158Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:02.3506536Z 2025-05-07T20:11:03.2972862Z 93e77f057140a02ac3b29f80d51810bece24f724 dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:03.2974655Z 2025-05-07T20:11:03.2975452Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:03.2976176Z 2025-05-07T20:11:05.4981912Z 6a3ebd3669b1ce61c529224063f1a9ed5507b63a554598a5d91a1fe812635010 dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:05.4983228Z 2025-05-07T20:11:05.4983486Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:05.4983858Z 2025-05-07T20:11:06.3419088Z 5c2753ed3de868e5d705c2c1dda0dc63 dist/fbgemm_gpu_nightly-2025.5.7-cp312-cp312-manylinux_2_28_x86_64.whl 2025-05-07T20:11:06.3419703Z 2025-05-07T20:11:06.3419867Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:06.3528703Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:06.3529031Z with: 2025-05-07T20:11:06.3529304Z name: fbgemm_default_x86_clang_py3.12_cu12.6.3.whl 2025-05-07T20:11:06.3529642Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:06.3529939Z if-no-files-found: error 2025-05-07T20:11:06.3530199Z compression-level: 6 2025-05-07T20:11:06.3530459Z overwrite: false 2025-05-07T20:11:06.3530807Z include-hidden-files: false 2025-05-07T20:11:06.3531057Z env: 2025-05-07T20:11:06.3531291Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:06.3531591Z BUILD_ENV: build_binary 2025-05-07T20:11:06.3531850Z BUILD_TARGET: default 2025-05-07T20:11:06.3532080Z BUILD_VARIANT: cuda 2025-05-07T20:11:06.3532348Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:06.3532604Z ##[endgroup] 2025-05-07T20:11:06.3536398Z ##[command]/usr/bin/docker exec 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:06.7523818Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:06.7525860Z Artifact name is valid! 2025-05-07T20:11:06.7526837Z Root directory input is valid! 2025-05-07T20:11:06.8291316Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:07.4223116Z Uploaded bytes 8388608 2025-05-07T20:11:07.6940510Z Uploaded bytes 16777216 2025-05-07T20:11:08.0403930Z Uploaded bytes 25165824 2025-05-07T20:11:08.3596752Z Uploaded bytes 33554432 2025-05-07T20:11:08.6634296Z Uploaded bytes 41943040 2025-05-07T20:11:08.9734766Z Uploaded bytes 50331648 2025-05-07T20:11:09.2957747Z Uploaded bytes 58720256 2025-05-07T20:11:09.5799846Z Uploaded bytes 67108864 2025-05-07T20:11:09.8908015Z Uploaded bytes 75497472 2025-05-07T20:11:10.2764393Z Uploaded bytes 83886080 2025-05-07T20:11:10.4901988Z Uploaded bytes 92274688 2025-05-07T20:11:10.8395848Z Uploaded bytes 100663296 2025-05-07T20:11:11.1377781Z Uploaded bytes 109051904 2025-05-07T20:11:11.4952301Z Uploaded bytes 117440512 2025-05-07T20:11:11.7498211Z Uploaded bytes 125829120 2025-05-07T20:11:12.1223247Z Uploaded bytes 134217728 2025-05-07T20:11:12.3923960Z Uploaded bytes 142606336 2025-05-07T20:11:12.7700650Z Uploaded bytes 150994944 2025-05-07T20:11:13.0270945Z Uploaded bytes 159383552 2025-05-07T20:11:13.3034391Z Uploaded bytes 167772160 2025-05-07T20:11:13.5531233Z Uploaded bytes 176160768 2025-05-07T20:11:13.9143223Z Uploaded bytes 184549376 2025-05-07T20:11:14.2041652Z Uploaded bytes 192937984 2025-05-07T20:11:14.4838903Z Uploaded bytes 201326592 2025-05-07T20:11:14.8298400Z Uploaded bytes 209715200 2025-05-07T20:11:15.0659043Z Uploaded bytes 218103808 2025-05-07T20:11:15.4233866Z Uploaded bytes 226492416 2025-05-07T20:11:15.7017168Z Uploaded bytes 234881024 2025-05-07T20:11:15.9569466Z Uploaded bytes 243269632 2025-05-07T20:11:16.2096372Z Uploaded bytes 251658240 2025-05-07T20:11:16.4808086Z Uploaded bytes 260046848 2025-05-07T20:11:16.7816217Z Uploaded bytes 268435456 2025-05-07T20:11:17.0461767Z Uploaded bytes 276824064 2025-05-07T20:11:17.3631510Z Uploaded bytes 285212672 2025-05-07T20:11:17.5511750Z Uploaded bytes 293601280 2025-05-07T20:11:17.9138848Z Uploaded bytes 301989888 2025-05-07T20:11:18.2211983Z Uploaded bytes 310378496 2025-05-07T20:11:18.5355731Z Uploaded bytes 318767104 2025-05-07T20:11:18.8436043Z Uploaded bytes 327155712 2025-05-07T20:11:19.1568688Z Uploaded bytes 335544320 2025-05-07T20:11:19.3994546Z Uploaded bytes 343932928 2025-05-07T20:11:19.7149286Z Uploaded bytes 352321536 2025-05-07T20:11:20.0194173Z Uploaded bytes 360710144 2025-05-07T20:11:20.2960574Z Uploaded bytes 369098752 2025-05-07T20:11:20.6037230Z Uploaded bytes 377487360 2025-05-07T20:11:20.8523713Z Uploaded bytes 385875968 2025-05-07T20:11:21.2454199Z Uploaded bytes 394264576 2025-05-07T20:11:21.5298279Z Uploaded bytes 402653184 2025-05-07T20:11:21.8471325Z Uploaded bytes 411041792 2025-05-07T20:11:22.0643292Z Uploaded bytes 419430400 2025-05-07T20:11:22.4114338Z Uploaded bytes 427819008 2025-05-07T20:11:22.6493620Z Uploaded bytes 436207616 2025-05-07T20:11:22.9425568Z Uploaded bytes 444596224 2025-05-07T20:11:23.1786767Z Uploaded bytes 452984832 2025-05-07T20:11:23.5036116Z Uploaded bytes 461373440 2025-05-07T20:11:23.8171510Z Uploaded bytes 469762048 2025-05-07T20:11:24.0640040Z Uploaded bytes 478150656 2025-05-07T20:11:24.3547982Z Uploaded bytes 486539264 2025-05-07T20:11:24.6275746Z Uploaded bytes 494927872 2025-05-07T20:11:24.8955519Z Uploaded bytes 503316480 2025-05-07T20:11:25.1643909Z Uploaded bytes 511705088 2025-05-07T20:11:25.4604392Z Uploaded bytes 518311089 2025-05-07T20:11:25.4764272Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:25.4766739Z SHA256 digest of uploaded artifact zip is ec2f35797a5b3d1a5716a97d177b49243e353ace74ecff93200a07588fea455a 2025-05-07T20:11:25.4768502Z Finalizing artifact upload 2025-05-07T20:11:25.5672635Z Artifact fbgemm_default_x86_clang_py3.12_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081456623 2025-05-07T20:11:25.5675498Z Artifact fbgemm_default_x86_clang_py3.12_cu12.6.3.whl has been successfully uploaded! Final size is 518311089 bytes. Artifact ID is 3081456623 2025-05-07T20:11:25.5687201Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081456623 2025-05-07T20:11:25.5927432Z Post job cleanup. 2025-05-07T20:11:25.5932522Z ##[command]/usr/bin/docker exec 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:25.8759417Z [command]/usr/bin/git version 2025-05-07T20:11:25.8793590Z git version 2.47.1 2025-05-07T20:11:25.8826014Z Copying '/github/home/.gitconfig' to '/__w/_temp/fa5a1273-7fb4-46ce-8e6d-27f970cc9cb7/.gitconfig' 2025-05-07T20:11:25.8834856Z Temporarily overriding HOME='/__w/_temp/fa5a1273-7fb4-46ce-8e6d-27f970cc9cb7' before making global git config changes 2025-05-07T20:11:25.8835747Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:25.8848816Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:25.8884099Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:25.8910991Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:25.9192388Z Entering 'external/asmjit' 2025-05-07T20:11:25.9243710Z Entering 'external/composable_kernel' 2025-05-07T20:11:25.9319228Z Entering 'external/cpuinfo' 2025-05-07T20:11:25.9385368Z Entering 'external/cutlass' 2025-05-07T20:11:25.9441597Z Entering 'external/googletest' 2025-05-07T20:11:25.9509263Z Entering 'external/hipify_torch' 2025-05-07T20:11:25.9557201Z Entering 'external/json' 2025-05-07T20:11:25.9638974Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:25.9658563Z http.https://github.com/.extraheader 2025-05-07T20:11:25.9663930Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:25.9690985Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:25.9959100Z Entering 'external/asmjit' 2025-05-07T20:11:26.0013612Z http.https://github.com/.extraheader 2025-05-07T20:11:26.0049213Z Entering 'external/composable_kernel' 2025-05-07T20:11:26.0094028Z http.https://github.com/.extraheader 2025-05-07T20:11:26.0140188Z Entering 'external/cpuinfo' 2025-05-07T20:11:26.0172351Z http.https://github.com/.extraheader 2025-05-07T20:11:26.0205231Z Entering 'external/cutlass' 2025-05-07T20:11:26.0253646Z http.https://github.com/.extraheader 2025-05-07T20:11:26.0297558Z Entering 'external/googletest' 2025-05-07T20:11:26.0332421Z http.https://github.com/.extraheader 2025-05-07T20:11:26.0367041Z Entering 'external/hipify_torch' 2025-05-07T20:11:26.0414537Z http.https://github.com/.extraheader 2025-05-07T20:11:26.0454583Z Entering 'external/json' 2025-05-07T20:11:26.0488640Z http.https://github.com/.extraheader 2025-05-07T20:11:26.0657001Z Stop and remove container: caafb01e9845451cad3dc12376cedc73_amazonlinux2023_f45b6e 2025-05-07T20:11:26.0662629Z ##[command]/usr/bin/docker rm --force 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f 2025-05-07T20:11:26.8675702Z 2c96c3f709dd7ccfbab62bd0bb03d0fe2bfdbbcabb6920f028364b7a349d9b1f 2025-05-07T20:11:26.8712587Z Remove container network: github_network_b577c48a8b564d3ca14c8193d220a990 2025-05-07T20:11:26.8717499Z ##[command]/usr/bin/docker network rm github_network_b577c48a8b564d3ca14c8193d220a990 2025-05-07T20:11:27.7428134Z github_network_b577c48a8b564d3ca14c8193d220a990 2025-05-07T20:11:27.7470037Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:27.7488698Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:27.7493939Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:27.7494332Z ##[endgroup] 2025-05-07T20:11:27.7623345Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:11:37.8229021Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:11:53.7064911Z Cleaning up orphan processes